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In this paper a multi-band envelope-function Hamiltonian for lattice-matched semiconductor het- 
erostructures is derived from first-principles self-consistent norm-conserving pseudopotentials. The 
theory is applicable to isovalent or heterovalent heterostructures with macroscopically neutral in- 
terfaces and no spontaneous bulk polarization. The key assumption — proved in earlier numerical 
studies — is that the heterostructure can be treated as a weak perturbation with respect to some 
periodic reference crystal, with the nonlinear response small in comparison to the linear response. 
Quadratic response theory is then used in conjunction with k • p perturbation theory to develop a 
multi-band effective-mass Hamiltonian (for slowly varying envelope functions) in which all interface 
band-mixing effects are determined by the linear response. To within terms of the same order as the 
position dependence of the effective mass, the quadratic response contributes only a bulk band offset 
term and an interface dipole term, both of which are diagonal in the effective-mass Hamiltonian. 
The interface band mixing is therefore described by a set of bulk-like parameters modulated by a 
structure factor that determines the distribution of atoms in the heterostructure. The same linear 
parameters determine the interface band-mixing Hamiltonian for slowly varying and (sufficiently 
large) abrupt heterostructures of arbitrary shape and orientation. Long-range multipole Coulomb 
fields arise in quantum wires or dots, but have no qualitative effect in two-dimensional systems 
beyond a dipole contribution to the band offsets. The method of invariants is used to determine 
the explicit form of the Hamiltonian for Fe and T% states in semiconductors with the zinc-blende 
structure, and for intervalley mixing of F and X electrons in (001) GaAs/AlAs heterostructures. 
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I. INTRODUCTION 
A. Background and motivation 

Envelope-function models continue to play a key role 
in the design and interpretation of experiments on semi- 
conductor heterostructures. The canonical "envelope- 
function approximation," which by definition makes use 
of only bulk effective-mass parameters and heterojunc- 
tion band offsets, has been very successful in explaining 
a wide range of experiments^^ However, recent studies 
have increasingly emphasized interface-related effects ly- 
ing outside the scope of conventional envelope-function 
theory, such as optical 3 * 4 *^ and electrical 6 * 7 ^ anisotropy, 
intervalley mixingjSiifliiLi 2 *- 3 *- 4 *- 5 . and spin polariza- 
tion phenomena, 16 ! 17 ! 18 ! 19 . 20 ! 21 . 22 ! 23 ! 24 ! 25 ! 26 ! 27 These ef- 
fects are generated (wholly or in part) by interface band- 
mixing terms in the heterostructure Hamiltonian, such 
as valence-band mixing of light and heavy holes 
T-X coupling, 10 '—: and the conduction or valence band 
Rashba coupling. 31 

A great deal of progress in the modeling of these ef- 
fects can be made on the basis of symmetry informa- 
tion alone. The method of invariants, which was orig- 
inally developed for bulk semiconductors) 3 ^ 3 * 3 ^ 5 " has 
proved a powerful tool in the study of heterostructures 

ag „ T u 6-15. 16. 17.18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 36. 37.38 

The standard method of invariants uses symmetry in- 
formation to construct an explicit interface Hamiltonian, 
but the same information (supplemented by hermiticity 
or current-conservation requirements) may also be used 



to construct connection rules for the envelope functions 
on opposite sides of the interfacei 39 > 40 i 41 i 42 ' 43 i 44 > 45 i 46 In 
either case, one obtains a phenomenological model con- 
taining some interface parameters whose value is deter- 
mined by comparison with experiment. 

Useful as this approach may be, it does not provide 
any information about the magnitude of the interface 
parameters, some of which (for reasons unrelated 
to symmetry) may happen to be zero or negligibly 
small. To obtain this information (along with a deeper 
understanding of the physical origin of the interface phe- 
nomena), one must turn to a more detailed microscopic 
model of the interface. Thus, numerous envelope- 
function models have been derived directly from the 
microscopic potential energy; these include interface 
Hamiltonia na 47 - 48 - 49 - 50 - 51 - 52 - 53 - 54 - 55 - 56 - 57 - 58 - 59 - 60 - 61 - 62 - 63 - 64 - 65 - 66 - 67 - 68 - 69 -' 
and connection ruleo 72 ! 73 ! 74 ! 75 ! 76 ! 77 ! 78 ! 79 ! 80 ! 81 ! 82 as well 
as numerical approaches based on solving the k • p 
equations in momentum spaceSSi 

However, all of the cited derivations are based on em- 
pirical pseudopotentials, in which the potential energy 
is determined by the choice of some specific model for 
the interface. This leads to ambiguity in the results, as 
different choices may yield conflicting predictions as to 
which interface parameters are important;* 2 ** 3 ** 4 * 6 * 1 * 6 - 4 ^ 
or whether interface band-mixing effects are directly re- 
lated to band offsets ^21^^ Such conflicts can only be 
resolved by deriving the envelope-function Hamiltonian 
from an ab initio self-consistent potential, as suggested by 
Sham and LuiS 7 . The purpose of this paper is to present 
such a derivation and examine its implications for inter- 
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face band-mixing effects. Numerical applications of the 
theory are not considered here. A preliminary account of 
these results has been presented elsewhere^ 



B. Basic assumptions and limitations 

The current "standard model" for condensed-matter 
physics is based on density-functional theory in the lo- 
cal density approximation (LDA) ,SL22i2i However, it is 
well known that this ground-state formalism does not 
accurately predict energy gaps in semiconductors. Thus, 
the present state-of-the-art in band-structure theory in- 
volves calculations of the electron self-energy in the 
shielded-interaction 92-93 or GW approximationiSli2Si2Si21 
This field is not yet fully mature, as apparent early 
successes22i22iiffl have been questioned in light of recent 
developmentsiifli*ifl2i 1 i£i±24i±2& Nevertheless, any future 
refinements in approximation techniques for calculating 
the quasiparticle band structure will continue to be based 
on the self-energy in Dyson's equation. For this reason, 
the present work relies only upon general properties of 
the self-energy operator, not upon any specific approxi- 
mation to this quantity. 

For simplicity, however, this paper assumes that the 
ionic cores are described in terms of norm-conserving 
pseudopotentialsi 1 Wi 1 QTi 1 99i 1 9?i 11 9 The projector aug- 
mented wave (PAW) method of Blochl t 1( i ) ?i 1 ^i 111 which 
does not involve this approximation, has become in- 
creasingly popular in recent years. A generalization of 
the present theory to incorporate the PAW formalism 
should be possible, but to avoid undue complexity it is 
not considered here. In addition, this paper considers 
only lattice-matched systems (with no bulk or interface 
strain). 

The key assumption used in the present work is 
that the heterostructure is a small perturbation^ 
with respect to some periodic virtual reference crys- 
tal (such as Alo.5Gao.5As for a GaAs/AlAs heterostruc- 
ture). This assumption has been verified within LDA 
for both isovalent and heterovalent systems, includ- 
ing GaAs/AlAs, 113 ! 114 Ge/GaA S| 114 i 115 Ino.53Gao.47As/- 
InP, 114 i 116 GaAs/Si/AlAs and GaAs/Ge/AlAs t 117 '" 8 
Si/Ge, 119 i 120 and InAs/GaSb, 121 i 122 Within the frame- 
work of pseudopotential theory, one would expect it 
to be valid for other similar heterostructures also. 
This assumption makes it possible to describe the self- 
consistent heterostructure perturbation in terms of non- 
linear response theory, with the linear response pre- 
dominant and the quadratic response providing a weak 
;;rr:cticn. 113 - 114 - 11R - 116 - 117 - 118 - 119 - 12n - 121 - 122 

Such an approach has been used by Sham 1 ^ 3 - to derive 
an effective-mass equation for shallow impurity states in 
semiconductors. Sham's work was extended in the pre- 
ceding pape»i2i to obtain expressions for the self-energy 
at small values of the crystal momentum in lattice- 
matched heterostructures described by spin-dependent 
nonlocal pseudopotentials. These expressions are used 



here to construct a multi-band effective-mass theory for 
lattice- matched heterostructures. 

Although the basic formalism developed here is quite 
general, to keep the paper to a reasonable length it is nec- 
essary to impose some restrictions on the material sys- 
tems that are treated in detail. (This allows the power 
series in Sec.^to be terminated at a reasonably low or- 
der.) Since only lattice-matched systems are considered 
here, it is assumed that the symmetry of the reference 
crystal does not support a spontaneous polarization, be- 
cause that would generate macroscopic electric fields and 
piezoelectric strain fields inconsistent with the lattice- 
matching assumption. Therefore, the present theory is 
not directly applicable to wurtzite materials. 

In addition, it is assumed that the interfaces in a het- 
erovalent system such as Ge/GaAs are macroscopically 
neutraljii 4 ^^ so that the atoms can be grouped together 
into neutral clusters (of fractional atoms) that carry a 
dipole moment only at the interface Thus, the 
main application of the present theory would be to isova- 
lent or heterovalent heterostructures of semiconductors 
with the zinc-blende or diamond structure. However, the 
only examples treated explicitly here are from isovalent 
systems. 

Lattice mismatch could not be included in this the- 
ory without fundamental changes to account for atomic 
relaxation. Nevertheless, the present results provide a 
solid foundation for subsequent extensions of the theory 
to include this effect. Other restrictions, such as macro- 
scopic neutrality and the lack of bulk dipole terms, could 
be lifted merely by extending some power series expan- 
sions to higher order (although, depending on the accu- 
racy that is desired^* 1 this may require th e inc lusion of 
certain nonanalytic terms neglected in Ref. Il24j) . 

The fact that (within the pseudopotential approx- 
imation) a typical heterostructure is a weak pertur- 
bation 112 . 113 i 114 . 115 i 116 . 117 i 118 i 119 i 120 i 121 i 122 makes possi- 
ble the existence of energy eigenstates \ip) whose wave 
function (nk|-0) in the Luttinger-Kohn representationiSl 
|nk) of the reference crystal is negligible outside a small 
region in k space near the high-symmetry points of the 
Brillouin zone ,i22ii2iLi22ii2i These eigenstates are just the 
low-energy excitations that are of greatest interest exper- 
imentally. Their existence, which has been verified by 
extensive numerical work on empirical pseudopotential 
modelsjiSSiiSiLiiffiiiSi is precisely the condition needed for 
the validity of an effective-mass approximation for slowly 
varying envelope functions F n (x) ^^^^^^^2^ since 
F n (k) = (?ik I ip). The existence of slowly varying 
envelopea 50 i 51 i 128 i 129 i 130 i 131 provides the foundation for 
the effective-mass theory derived here. 

The present approach, based on quadratic response 
theoryjiSi fits well with the k • p perturbation formal- 
ism of Leibler, 47 i 48 i 65 i 66 i 67 i 68 i 69 in which the heterostruc- 
ture perturbation is treated as small in comparison to 
the energy separating the bands of interest from other 
remote bands. Leibler's theory is used here to develop a 
multi-band effective-mass theory that includes all terms 
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of the same order as the position dependence of the ef- 
fective mass. This includes cubic and quartic disper- 
sion terms in the Hamiltonian of the reference crystal, 
as well as the leading contributions from the quadratic 
response. As shown in Sec. [H] higher-order terms can- 
not consistently be described in terms of local differential 
equations SMMLMM 



C. Summary of key results 

In this paper, it is shown that the dominant interface 
band-mixing terms are those arising from the linear re- 
sponse. Indeed, the linear response is the only contri- 
bution to band mixing that fits within the framework 
of the perturbation theory defined above, and the only 
contribution that can consistently be described in terms 
of local differential equations. To this level of accuracy, 
the quadratic response contributes only a bulk band off- 
set term and an interface dipole term, neither of which 
produces any band mixing in the effective-mass Hamil- 
tonian. 

This represents a major simplification, since it implies 
that the interface band-mixing terms in the Hamiltonian 
are just a superposition of parameters derived from the 
linear response to individual ionic perturbations (or neu- 
tral cluster perturbation9i2i±2£ in the case of heterova- 
lent substitutions). These parameters are calculated once 
and for all for a given material system; they then appear 
as coefficients in front of structure factors describing the 
distribution of atoms in the heterostructure. The Hamil- 
tonian contains spatial derivatives of (and differences 
between) these structure factors, which generate (5-like 
functions at an interface. The atomic distribution func- 
tions change for heterostructures of different shape (wells, 
wires, or dots) and orientation, but their coefficients do 
not. Thus, a single set of interface parameters governs 
the band mixing for any type of heterostructure^^ in a 
given material system. 

If desired, one can express these linear parameters as 
a difference between the properties of the various bulk 
materials that make up the heterostructure. (A simi- 
lar result was obtained for linear band offsets in Refs. 
HT3ll4lll. t lllflll7lll8lliai2ril21ll22l ') However, in a 
no-common-atom system such as InAs/GaSb, this must 
include the (lattice-matched) "interface" materials GaAs 
and InSb. Also, these bulk-like linear response pa- 
rameters (which in principle do not require a supercell 
calculation-^) cannot be determined from experiments 
on bulk semiconductors. 

For some terms in the linear response (derived from the 
analytic part of the self-energy«i2i) , the interface Hamil- 
tonian itself has the form of a macroscopic average of 
the Dirac S function or its derivatives. For the remain- 
ing terms (derived from the nonanalytic part of the self- 
energy 1 - 4 ), the effective charge density has this local- 
ized form, but the potential energy is not as well local- 
ized, having the form of a long-range multipole potential. 



However, in a quantum well or any other heterostructure 
with two-dimensional translation symmetry, all terms are 
well localized except for the interface dipole terms (which 
merely modify the band offsets). 

The band-mixing Hamiltonian has the same general 
form for slowly graded structures (within the virtual crys- 
tal approximation) and abrupt heterojunctions, the only 
difference being the rate of change of the structure factor. 
That is, in contrast to previous theories based on model 
potentials or empirical pseudopotentials, no new band- 
mixing parameters appear at an abrupt junction. Indeed, 
the localized interface terms in the Hamiltonian derived 
here are qualitatively identical to the Hamiltonian de- 
rived by LeibleB 4 ^ 4 ^ for slowly graded heterostructures. 

These results shed light on recent 
suggestiona 12 i 13 i 14 i 64 i 69 that the T x -X lz and X lx - 
X\ y intervalley mixing potentials at an ideal (001) 
GaAs/AlAs hetero junction should be proportional to 
5(z). It is shown here that such mixing arises only from 
the quadratic response and is therefore negligible in 
comparison to Ti—X^z, X\ Z ~X^ Z , and X^ x -X^ y mixing. 
There is, however, a linear contribution to the T\-Xi z 
and X\ x -Xi y mixing that is proportional to 5'(z), as well 
as a linear 8{z) mixing for nonideal interfaces or when 
spin-orbit coupling is included. These contributions may 
help to explain the experimental observations in Refs. 

OiiTi 



D. Outline of the paper 

The paper begins in Sec. ^ with a discussion of which 
terms are to be retained in k • p perturbation the- 
ory, based upon a review of important recent work by 
Takhtamirov and Volkovi 65 i 66 i 67 i 68 i 69 The main results of 
the quadratic response theory developed in the preceding 
papeniSi are presented (in modified form) in Sec. lIIII The 
basic envelope-function formalism is developed in Sec. 
II VI while Sec. |V] describes the power series expansions 
that are used to obtain approximate expressions valid 
for slowly varying envelopes. Perturbation theory is used 
to eliminate the coupling to "remote" bands in Sec. IVII 
yielding the basic expression for the multi-band envelope- 
function Hamiltonian. Modifications to the Hamiltonian 
that are necessary if one wishes to describe the mate- 
rial parameters of an abrupt heterojunction as piecewise 
constant are discussed in Sec. IVIII Symmetry properties 
of the Hamiltonian are discussed in Sec. IVIII1 where ex- 
plicit matrix representations of the material properties 
are given for semiconductors with the zinc-blende struc- 
ture. Finally, the significance of the results and their 
relation to previous work in the literature are discussed 
in Sec. 
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II. ORDER OF TERMS INCLUDED 

Takhtamirov and Volkov^^^i^?? have recently 
demonstrated (using an instructive analogy to the lead- 
ing relativistic corrections to the Schrodinger equation) 
that most derivations of heterostructure effective-mass 
equations in the literature (including those of the present 
author) do not consistently include all perturbative cor- 
rections of the same order. This section presents a review 
and discussion of their results, with the objective of es- 
tablishing which terms are to be retained in subsequent 
perturbative approximations. 

The first case to be considered is a wide-gap system 
satisfying 



AV <£, 



(2.1) 



where AV^ is a typical heterojunction band offset, and 
Eg is a typical energy gap (for the virtual bulk reference 
crystal) separating the band in question from all other 
"remote" bands. It is assumed that in the reference crys- 
tal this band is describable by an effective-mass equation 
with effective mass m* . This will be the case if the k • p 
interaction with remote bands 



hkp 



EE (k\)E s 



(2.2) 



can be treated as a small perturbation. Here m is 
the free-electron mass, k ~ 2ir/L is a typical envelope 
function wavenumber for a quantum well of width L, 
p ~ 2ith/a is a typical interband momentum matrix 
element (where a is the lattice constant), and it is as- 
sumed that L ^S> a. Equation 12. 2|) defines a length pa- 
rameter A = hp/mEg, which (if the free-electron con- 
tribution to m* is negligible) may also be written as 
A ~ h(2m*E g )~ 1 / 2 . For GaAs, both expressions for A 
give A~6 A, thus A ~ a and fcA <C 1 in a wide quantum 
well. For the remainder of this paper the parameters A 
and a are used interchangeably, although in general er- 
ror estimates should be based on the larger of the two 
quantities. 

In general one is interested in cases where the kinetic 
energy is comparable to the band offset: 



2 1,2 



h 2 k 

2m* 



AV ~ (kX) 2 Eg 



(2.3) 



This is the order of terms included in ordinary effective- 
mass theory^ 7 , which may be compared with the nonrel- 
ativistic (Schrodinger) approximation to the Dirac equa- 
tion. The position dependence of the effective mass can 
be calculated by treating AV/E g as a perturbation; 47 ! 48 
yielding a correction of order 



h 2 k 2 (AV\ 
\Eg~) 



(Ml 

E„ 



(k\) 2 AV, 



(2.4) 



which is of the same order as the nonparabolic k 4 terms in 
the kinetic energy of the reference crystal. Hence, these 



terms (analogous to the relativistic mass corrections to 
the Schrodinger equation) must be included if one is to 
retain all terms of the same order as the position depen- 
dence of the effective mass. 

As will be seen below, the interface band-mixing terms 
proportional to a S function are of order 



A(S(z)) ~ (ka)AV, 



(2.5) 



while those proportional to the derivative of a S function 
are of order 



B(S'(z)) ~ (ka) 2 AV, 



(2.6) 



in which A and B are constants. Hence, the latter terms 
are also comparable to the position dependence of the 
effective mass. 

For a step-function discontinuity AV in the potential 
energy, the envelope function F has a discontinuity in its 
second derivative given by 



AF" 2m* 



F 



h 2 



-AV, 



(2.7) 



where F ~ L -1 / 2 . This gives rise to an asymptotic be- 
havior in k space of F(k) ~ (AF")fc~ 3 . But in the exact 
Luttinger-Kohn envelope-function representation^ 7 , the 
envelope functions are limited to wave vectors inside the 
first Brillouin zone. Hence, the use of a local differential 
effective-mass equation, which gives rise to nonvanishing 
F(k) outside the Brillouin zone, generates an error in the 
kinetic energy of order 



(ka) 



h 2 k 2 
2m* 



{kafAV. 



(2.8) 



Therefore, in the local approximation, the accuracy is 
limited by Eq. i|2.8J) . and contributions beyond the level 
of (|2.4|l and (|2.tj|) should for consistency be omitted. 

Of course, this is not a fundamental limitation of the 
envelope-function method, as one can always choose to 
work in k space (which is a common choice for numerical 
worki2iiSii^&) . However, there is another similar source 
of error that arises from anticrossings of the bands explic- 
itly included in the envelope-function model with those 
treated as remote perturbations. This occurs, for exam- 
ple, in the Tq conduction band of GaAs at about a third 
of the distance to the Brillouin zone boundary in the A 
direction. Beyond such an anticrossing the model is no 
longer valid and an error of order (|2.8() arises even in a 
k-space formalism. To eliminate such errors one must en- 
large the Hamiltonian by treating these remote bands ex- 
plicitly, thus obtaining a full-zone k- p model. 74,83 - 137 - 138 

Such a multi-band envelope-function model is also 
needed for medium-gap (AV ~ Eg ' 1 ) and narrow-gap 

(-Eg "* <C AV) systems, where E^ is the energy gap of 
the reference crystal. The perturbative approach based 
on Eq. (|2.1|l can still be used, provided that E g refers 
to the energy gap between the bands of interest and 
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those treated as remote perturbations. Takhtamirov 
and VolkovSi have considered the extreme narrow-gap 
limit for the case in which the dispersion is dominated 
by linear- k terms for all energy ranges of interest, and 
showed that in this case the error generated by the local 
approximation is of order (ka) 2 AV. In such cases the 
terms <|2.4|) and l|2.6|l should be omitted for consistency. 
However, in practical situations one is more likely to en- 
counter cases in which AV ~ Eg and the k 2 dispersion 
terms are comparable to the linear-fc terms. Thus, in 
this paper all terms of order (ka) 2 AV are retained, while 
those at the level of the local approximation 1|2.8[) are 
omitted. 

It should be noted that a fully self-consistent pertur- 
bation scheme is not always desirable. It would of course 
be impossible to achieve an accurate numerical predic- 
tion for the fine structure of the hydrogen atom without 
including the relativistic-mass correction and the Darwin 
term in addition to the spin-orbit splitting. However, for 
qualitative considerations one is often interested primar- 
ily in symmetry-breaking effects, in which case the for- 
mer two contributions may justifiably be omitted. Like- 
wise, although the present work retains all terms of order 
(ka) 2 AV, certain of these (such as the k 4 bulk disper- 
sion terms) may possibly be omitted for applications in 
which the primary focus is on symmetry-breaking inter- 
face effects. [As an example, Takhtamirov and Volkov 
have proposed a model 6 based on certain terms of order 
(ka) 3 AV, while neglecting larger terms of order (ka)AV 
and (ka) 2 AV.) However, the neglect of these terms can 
only be justified for specific individual applications and 
should not be presumed to hold in general. 

The discussion here has focused on wide quantum 
wells, but this should not be taken to imply that effective- 
mass theory is inapplicable in other cases. For example, 
in a narrow quantum well^2*S an effective-mass approx- 
imation can be developed along the same lines as for 
shallow impurities>iSii2i In this case, however, the above 
estimates of the interface terms and local approximation 
are no longer valid; see Ref. |H for further details. 

The remainder of this paper uses atomic units with 
h = m = e = 1 . 



III. QUADRATIC RESPONSE THEORY 

A. Basic definitions 

The problem of interest is the Dyson eigenvalue equa- 
tion (see Ref. Il24l and references therein) 

-±V 2 i/>(x,w)+ J F(x,x',w)^(x»£lV = E(u))ip(pt,w), 

(3.1) 

in which lo is a complex energy parameter, E is the com- 
plex eigenvalue, and the self-consistent potential V is the 
sum of a fixed norm-conserving ionic pseudopotential V\ on 



and the nonhermitian self-energy operator S: 

V(x, x', u) = M on (x, x') + E(x, x', u). (3.2) 

For notational simplicity, the w dependence will be sup- 
pressed in most of the equations that follow. In Eq. (|3.1|1 . 
-0 and V are spinors, with V having the form 

V = 1 V sc + a • V, (3.3) 

in which 1 is the 2x2 spinor unit matrix, V sc is the 
scalar relativistic part of V , cr is the Pauli matrix, and V 
accounts for spin-orbit couplingfi22iA22iIiiiA2£ Note that 
Eq. 1)3.1(1 incorporates all relativistic corrections of order 
Z 2 a 2 (where Z is the atomic number and a is the fine- 
structure constant), but neglects terms of order a 2 , such 
as spin-orbit coupling outside the atomic coresii ^^^ 

In a heterostructure, it is convenient to partition the 
ionic pseudopotential as 

M on (x,x') = (*,*') + AMon(x,x'), (3.4) 

where is the ionic pseudopotential of some periodic 
reference crystal (which may be a virtual crystal). This 
is defined as 

V^(x,x') = /X^-Ri.x'-Ri), (3.5) 

in which W;° n (x, x') is the ionic pseudopotential for 
atomic species a, f a; > is the fractional weight associated 
with atomic a on site j at position Tj in the unit cell 
of the reference crystal, and Rj = R + Tj , where R is a 
Bravais lattice vector of the reference crystal. Z - 7 must 
satisfy 

0</ QJ <l, ^/ aj = l, (3.6) 

a 

although the former constraint need not be strictly en- 
forced. 

The term AV; on is the perturbation due to the hetero- 
structure: 

A^ on (x, x') = <>n <„(x - R„ x' - Rj)- (3-7) 

a,j,R 

Here is the change in fractional weight (relative to 
the reference crystal) of atom a at position Rj in the 
heterostructure, which must satisfy 

< f a3 + 6^ < 1, J2 6 t = 0- ( 3 - 8 ) 

a 

The constraint (|3.8|l permits one to rewrite AV[on as 
AT4on(x,x') = ^'^AtC(x-R Q ,x'-R a ), (3.9) 

in which 

A< n (x, x') = < on (x, x') - <j,(x, x'). (3.10) 
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Here a = (a, j) is a composite index, dj is the label of 
some given atom on site j, and the prime on the sum- 
mation symbol indicates that the values a = (dj,j) are 
excluded. To simplify the interpretation of (|3.9[l . the 
atom dj is chosen to be the same as the (virtual) atom 
on site j in the reference crystal. 

In this paper the self-consistent potential V(x, x') is 
treated using nonlinear response theory^* The funda- 
mental assumption is that V can be expressed as a power 
series in the variables 6* R : 

y(x,x') =F (0) (x,x') + ^ (1) (x,x') + y (2) (x,x') + -- - . 

(3.11) 

Here (x, x') is the self-consistent potential of the ref- 
erence crystal, which has the same periodicity as the ionic 
potential 13. ty : 



0°)(x,x') = V (0) (x + R,x' + R). 



(3.12) 



The linear response to the heterostructure perturbation 
has the form 



^ 1 )(x,x')= 5>£A^(x,x'), 



(3.13) 



a,R 



while the quadratic response is 



T^(x,x') 



£'£' 

a,R a',R' 



'r<A Vrr ,(x,x'). (3.14) 



Here the expansion coefficients in the power series are 
defined by 



A« R (x,x') 
A^(x,xO 



S6» R ' 

i9 2 y( x ,x') 

2 d6« <96> R ' ' 



(3.15) 



where the derivatives are evaluated with respect to the 
reference crystal. These derivatives can be evaluated nu- 
merically by applying perturbations l|3.9l) with small val- 
ues of # R . 

The leading source of error in quadratic response the- 
ory lies in the neglected cubic response term T^*- 3 '. This 
error can be reduced somewhat by a suitable choice of 
reference potential. For example, in a GaAs/AlAs het- 
erostructure, choosing Alo.5Gao.5As as a reference crystal 
instead of GaAs would reduce the cubic error in AlAs by 
a factor of 8. However, it would also create a compa- 
rable error in GaAs. Thus, the cubic error in the inter- 
face Hamiltonian for an Alo.5Gao.5As reference potential 
would be about j that of a GaAs or AlAs reference po- 
tential. This is certainly an improvement, but since the 
quadratic response is already quite small (see Sec. lIIIBl . 
it is unlikely to make much practical difference. 

In the momentum representation, the Dyson equation 
(13.11) has the form 



-fc 2 ^(k) + E v ( k > k 'M k ') = Ei/j(k). 



(3.16) 



Making use of the translation symmetry of the reference 
crystal, one can write the Fourier transforms of the po- 
tentials 1)3.1511 as 



Ai£(k,k') = e- l(k - k,) - R "Aw Q (k, k'), 
A«g^,(k,k / ) = e - i < k - k '>- R «A« ' a ' R "(k ) k') 



(3.17) 



in which R" = R' — R, and the coordinate origin is 
R Q for the modified functions on the right-hand side. 
The Fourier transform of the linear response l|3.13|l can 
therefore be written as 

F (1) (k,k') = A^V(k-k')Av"(k,k'). (3.I8) 



Here N = ri / S7q is the number of unit cells (of volume 
fio) in the reference crystal (of volume f2), 9 a (k) is the 
Fourier transform of # R : 



^r= E ^(k)e 4k R % 
keo* 

w = ^ E 



(3.19a) 
(3.19b) 



Ren 



and 17q = (27r) 3 /fio is the volume of a unit cell in the 
reciprocal lattice. Note from (J3.19b[) that 6* Q (k) is quasi- 
periodic: 



8 a (k + G) = 6> Q (k)e 



(3.20) 



where G is a reciprocal lattice vector of the reference 
crystal. 

In a similar fashion, inserting (|3.17|) into the Fourier 
transform of (|3.14l) and relabeling R' gives 

T^ 2 )(k, k') = N E' ^ aQ ' R '(k-k')Au a,a ' R '(k,k / ) ! 



R' 



(3.21) 

in which 6 aa R (k) is the Fourier transform of the pair 
distribution function 6» QQ ' R '(R) = R R ' +R ,. 



B. Estimation of magnitude 

A crude estimate of the relative magnitudes of the lin- 
ear and quadratic response can be obtained from a simple 
nonlinear Thomas- Fermi model, 89 with the resulli 1 ^ 



~ AV, 



AV 

4e F ' 



H—, (3-22) 



where AV is the typical band offset defined in Sec.|Hl and 
£f is the Fermi energy. Now the difference in screened 
pseudopotentials between typical III-V semiconductors is 
roughly AV" ~ 0.02-0.05 Ry, 145 which yields the estimate 
|V (2) /V (1) | ~ 0.01. This suggests that the quadratic re- 
sponse is indeed very small. 
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This estimate is supported by the LDA calculations 
of Wang and Zunger — 2 for GaAs/AlAs heterostructures, 
in which they found (see Table ii2£) that the inter- 
face band-mixing terms arising from the linear response 
were on average about 1000 times larger than those 
arising from the quadratic response (with the ratio be- 
tween the smallest linear and largest quadratic terms be- 
ing about 100, in agreement with the estimate obtained 
above). However, the LDA calculations fo r Ga As/ AlAs 
and Ino.53Gao.47As/InP presented in Refs. Ill j and Ill6l 
indicate that the quadratic density response is only about 
10 times smaller than the linear density response (the 
linear and quadratic potentials were not given in these 
papers). 

Nevertheless, this is still a sufficiently large ratio to es- 
tablish the validity of the quadratic approximation used 
here. For the purposes of the perturbation scheme of Sec. 
HTl the factor 4cf in the denominator of Eq. (|3.22H will 
be treated formally as of order E g , so that AV^ is con- 
sidered to be of the same order [(AV) 2 /E g ~ [ka) 2 AV] 
as the smallest terms retained in Sec. HH 



Eq. (|3.24|) are octopole and hexadecapole potentials pro- 
portional to q x q y q z /q 2 and (g£ + + q^)/q 2 i^ 
The quadratic potential has a similar form: 

Ai, RR ,(k + G, k' + G') = w%Si, (k, k'; G, G') 

+ A GG ,(k,k') ¥ > R < R ,(k-k'), (3.26) 

in which A is the same vertex function as above. In this 
case, however, the potential w RR / (k, k'; G, G') is not an 
analytic function of k and k'. Nevertheless, to within 
the accuracy required here (namely, zeroth order in k and 
k'), the nonanalytic part can be neglected for neutral per- 
turbations. Note that the analytic part of the quadratic 
potential is analytic over a smaller region in k space than 
the linear potential. Whereas i« R (k, k'; G, G') was an- 
alytic for k and k' inside the first Brillouin zone, the 
analytic part of id^, (k, k'; G, G') is analytic only over 
the inner "half" of the Brillouin zone (i.e., over the Bril- 
louin zone of a crystal whose lattice constants are double 
those of the reference crystal) . 

The quadratic screened potential is defined by 



Functional form 



As shown in Ref. 124, the Coulomb interaction gives 
rise to singularities in the linear and quadratic potentials 
Av(k, k') when k — k' is equal to a reciprocal lattice 
vector. The explicit form of the linear potential isi24 

Av R (k + G, k' + G') = tu R (k, k'; G, G') 

+ A GG ,(k,k> R (k-k'), (3.23) 

where the potential w^(k, k'; G, G') is an analytic func- 
tion of k and k' for wave vectors inside the first Brillouin 
zone of the reference crystal. For small values of k and k' 
it can be represented as a Taylor series, which is the basis 
for the effective-mass theory developed in the following 
section. The effective vertex function A GG '(k, k') is also 
an analytic function of k and k'. 

The singular contributions come from the screened po- 
tential ip, which is a spin scalar of the forr 



,124,146 



^(q) 



Mq)»R.(q) 
*(q) : 



(3.24) 



where e(q) is the static electronic dielectric function of 
the reference crystal and 



^RR<(q) 



^(q)ng° R ,(q) 
e(q) 



(3.27) 



in which the effective density n RR , (q) is likewise not an- 
alytic, but can be approximated as such. The leading 
terms here are dipole and quadrupole potentials (since 
the monopole term vanishes for an insulator at zero tem- 
perature). In zinc-blende crystals, only the interface 
dipole term is non-negligible under the approximation 
scheme used in this paper. 

For use in Eqs. I|3.18[) and l|3.21|) . one requires also ex- 
pressions for the modified linear and quadratic potentials 
defined in Eq. (|3.17|) . These are given by 

Av"(k + G,k' + G') = w GG ,(k,k') 

+ e i(G - G,) ^A GG ,(k,k> Q (k _ k '), (3.28a) 



Av c 



i'R' 



(k + G, k' + G') = w G Q G f- (k, k') 

i(6-G'}.T„ Agg , ( k) k'^o/R' (k - k'), (3.28b) 



in which the terms on the right-hand side are defined by 
expressions similar to (|3.17|l . 



« c (q) 



Air/q 2 if q ^ 0, 
if q = 0. 



(3.25) 



IV. ENVELOPE-FUNCTION EQUATIONS 



The function n R (q) is an effective electron density con- 
taining partial contributions from the bare ionic pseu- 
docharge and the screening charge. This is an analytic 
function of q (for small q) with the symmetry of site 
R Q in the reference crystal. For isovalent substitutions 
in zinc-blende crystals, the leading nonanalytic terms in 



In this section, the Dyson equation (|3.1()|) for a het- 
erostructure is written in the Luttinger-Kohn represen- 
tationjiSI in which the basis functions are defined to be 
plane waves multiplied by the zone-center Bloch func- 
tions U n of the periodic reference crystali^Z^ Here the 
set {U n } is chosen to be a complete set of solutions to 
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the equation [cf. Eq. (|3 -IB ] 

-±V 2 t/„(x,w) + J ^ (r ' 0) (x,x',w)f/„(x',cj)dV 

= E n {cj)U n {x,uj) (4.1) 

that satisfy the periodic boundary conditions 

C/„(x,w) = l/ n (x + R,w), (4.2) 

in which l/( r '°) is the hermitian part of the reference po- 
tential V(°). For a general operator A, the "real" (hermi- 
tian) and "imaginary" (antihermitian) parts are defined 
here as 

A« = -(A* + A), A® = -(A* - A). (4.3) 

The solutions to Eq. 1)4. Q form a complete orthonormal 
set of periodic functions 127 for any value of u>. In the 
Fourier series representation 



U n (x,uj) = ^ U n G(u)e' 



G 



(4.4) 



G 



the orthogonality and completeness relations are 

Ul G {u)U n , G {uj) = <W, (4.5a) 

G 

Y,UnG(Lo)Ui G ,{uj) = l6 GG i, (4.5b) 

n 

in which U nG denotes the hermitian conjugate of the 
spinor U nG . 

Note that the Bravais lattice chosen for the periodic 
boundary conditions in Eq. (|4.2() need not be the same 
as that in (|3.12() : for certain applications it may be 
preferable to impose periodicity with respect to some 
(mathematically defined) supercell instead. For exam- 
ple, in treating intervalley T—X coupling in semiconduc- 
tors with the zinc-blende structure, it is convenien1j£2i£i 
to choose a nonprimitive simple cubic unit cell of vol- 
ume flo = a 3 (where a is the conventional cubic lattice 
constant), which encompasses four primitive fee cells^ 4 ^ 
This folds the X valleys onto the Brillouin zone center of 
the supercell, thereby permitting intervalley T—X mixing 
to be described in the same notation as that for ordinary 
r states (although the T and X states are of course not 
coupled by the k ■ p interaction). 

The wave function in the Luttinger-Kohn representa- 
tion (or envelope function) ^(k) is defined as 

F n (k)=^C/+ G ^(k+G), (4.6a) 

G 

^(k + G) =^F„(k)C/„ G . (4.6b) 



This definition is valid for any value of k, but since only 
those values from one unit cell fig are needed to de- 
termine ^(x), it is convenient to set F n (k) = when 



k £ fig. 49 i 148 The Fourier transform of (|4~6b"|l is then the 
usual exact envelope-function expansion*!* 1 ^ 



V-(x) =£> n (x)tf n (x). 



(4.7) 



The Dyson equation in the Luttinger-Kohn represen- 
tation is given by Eqs. (|3.16ll . (14. 5[) . and l|4.6|l as 

£;„F„(k)+^i mi ,(k) J F 1 n '(k) 

n' 

+ Y, E &V nn ,(k,k')F n ,{k') = EF n (k). (4.8) 

n' k' GHq 

Here AV^ = V — is the perturbation due to the 
heterostructure, the matrix elements of which are 

AVw(k,k')= J2 ul G AV(k + G,k' + G')U n , G ,. 

G.G' 

(4.9) 

The term L nn i (k) groups together all contributions from 
the bulk reference crystal Hamiltonian except E n \ i.e., 



Ln*{k) =V^,(k)-V^r'(0)+k-p nn , + ^d nnl , (4.10) 



r(r,0) 



in which V^°3 (k) = V^, (k, k) is the potential energy of 
the reference crystal and 

p n „, =Y,GUl G U n , G (4.11) 

G 

is the momentum matrix of the reference crystal. 

Within quadratic response theory, the perturbation 
Ay„„'(k, k') is obtained by substituting Eqs. I|3.18[) . 
and iglSEl) into Eq. g2J. The result is 

AV nn , (k, k') = W nn , (k, k') + A w (k, k>(k - k'), 

(4.12) 

where the vertex function 



r(0) 



A n „,(k,k') = ul G A GG ,(k,k')U n , G , 



(4.13) 



G.G' 



is again an analytic function of k and k'. The W term is 
defined by W = W (1) + W (2 \ where 

w£>,(k,k') = £V( k - k')W„ Q n ,(k,k'), (4.14) 



W% n ,(k,k') = N £ Ul G w GG ,(k,k>)U n , G ,e^ G '~ G >^. 

G,G' 

(4.15) 

The screened potential tp is defined by 

u c (q)n(q) 



v(q) = 



* q 



(4.16) 



where n = + is an effective electron density for 
the heterostructure perturbation: 



(q)=iV]TV(qK(q). 



(4.17) 



9 



The quadratic contributions and are given by 

obvious generalizations [see Eq. 1)3.28)) ] of the above ex- 
pressions. The physical interpretation of these results is 
considered below. 



V. POWER SERIES EXPANSIONS 

In this section, power series expansions are used 
to obtain approximate expressions for the Hamilto- 
nian matrix elements in the envelope-function equa- 
tions 1)4.8)1 . This approximation is justified by the 
existence£2*£i*i2Li22iii22ii2L of slowly varying envelope 
functions F n (x), for which F„(k) is negligible unless k 
is small. This expansion provides a starting point for 
the development of an approximate effective-mass the- 
ory, and also assists in the physical interpretation of the 
various terms in the Hamiltonian. 



for energies ui near the band gap of the reference crys- 
tal, the contributions from E^ 1 ' are much smaller than 
the smallest terms retained in the present approxima- 
tion scheme. Such contributions were therefore neglected 
in the above expressions, and are likewise neglected in 
subsequent analysis of the heterostructure perturbation. 
However, in any calculation where it is desired to include 
the effects of a finite quasiparticle lifetime, the dominant 
terms may be restored to leading order by replacing E n 
with 

E n (u) -> E n (u;) +<E^(k = 0,w), (5.3) 

and then retaining the imaginary part only to first order 
in perturbation theory in all subsequent analysis. 



B. Linear heterostructure potential 



A. Reference crystal Hamiltonian 

The leading terms in the bulk Hamiltonian L nn / (k) of 
Eq. (14. 1 Up are derived from a Taylor series expansion of 

(k) , and are given through terms of the fourth order 
in k by 



Iw(k) = k X TT 



A 

nn' 



k\k,jk K C f ^ 



k\ k^ k K k v 



K nn' 

XflKlS 



(5.1) 



Here a summation with respect to the Cartesian indices 
A, fi, k, and v is implicit, and the coefficients are 



Pnn' 



dk x J ' 

/ k=0 



XfJ,K 



Q 



nn' 



Sx^Snn' + 

3! ydkxdk^dk,, 
1 ( d*V^?\k) 



dk\dka 



k=0 



k=0 



(5.2) 



4! \ dk\dkndk K dk v 



k=0 



The various derivatives of V^',°^(k) account for contri- 
butions from the nonlocal part of the potential energy 
to the dispersion relation of the reference crystal. The 
quantity 7r^ n / is the kinetic momentum matrix of the ref- 
erence crystal, whereas the other terms give partial con- 
tributions (see Sec. IVII for the remaining contributions) 
to the effective-mass tensor and the cubic and quartic 
dispersion terms of the reference crystal. 

In principle, L nn > (k) should include contributions from 
the antihermitian part of the self-energy E^^(k, ui) — 
vSffi (k, bj). However, in Appendix 1X1 it is shown that, 



A similar Taylor series expansion technique is useful 
for the terms W nn i (k, k') and A rm /(k, k') in the hetero- 
structure perturbation 1)4. 12)1 . This subsection begins by 
considering the simple special case in which the screened 
atomic pseudopotentials w^ G , (k, k') in Eqs. (|3.28a|) and 
(|4.15|) have the form of a local potential; i.e., 



w 



GG 



,(k,k') = w Q (k-k' + G-G') 



(5.4) 



Such would be the case, for example, in an LDA cal- 
culation based on local ionic pseudopotentials. In this 
is also a local potential of the form 



case, W^,(k,k' 
Ki), 



W^J, (k, k') = w£>, (k - k'). This simplification makes it 
easier to grasp the physical significance of the power se- 
ries expansion, and also facilitates a comparison between 
the present theory and earlier envelope-function models 
based on local empirical pseudopotentials. 



Local analytic terms 



The approximation technique used here for Wf^j (k — 
k') is to expand the term W^ n , (k — k') on the right-hand 
side of Eq. (|4.14|) in a Taylor series in q = k — k': 



? Q (q)[lO+i<ZA(Z, 



A 

nn' 



(5.5) 

Here the terms W% n ,, (Z* n ,) a , and {Y^,) a are q-inde- 
pendent expansion coefficients. The series 1)5.50 has been 
truncated at the second order in q because such terms 
are of order (ka) 2 AV, which are the smallest corrections 
permitted in the present perturbation scheme. 

The result 1)5.5)1 may be used directly in a k-space 
envelope- function calculation based on Eq. 1)4. 8)1 . in 
which the k values are expressly limited to the unit cell 
£7q- However, to obtain a local differential equation, one 
must allow k to range over all possible values. Since 
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6 a (k) is quasi-periodic [see Eq. (|3.20(l ]. this local approx- 
imation will generate large-k terms in the envelope func- 
tions unless a k-space cutoff is introduced. If this is done, 
the Fourier transform of 15.511 is the local potential 

(5.6) 

in which d\ — d/dx\ and [cf. Eq. I|3.19a|) ] 

<T(x) = ^5(k)(9 Q (k)e jkx , (5.7) 

k 

where the cutoff function B{k) is defined in Appendix iBl 
From Eq. I|3.8|) . the constraint 

]T^(x)=0 (5.8) 

a 

is satisfied for any choice of B(k). 

The physical significance of the result l|5.6|l can be ap- 
preciated by considering a specific example such as a 
(001) GaAs/AlAs hetcrojunction. In this will 
be shown in Sec. IVIll the function ^"(x) depends only 
on the z coordinate and behaves like a smooth step func- 
tion at the interface. The spatial derivatives in Eq. (|5.6|) 
therefore generate finite-width <5-like terms at the inter- 
face, with the Z term proportional to 8{z) and the Y 
term proportional to 6'(z). Hence, Eq. I|5.6[) provides a 
first example of the interface band-mixing terms alluded 
to previously in Eqs. I|2.5|) and Ij2.6|l . 

The physical origin of these terms can be understood 
by going back one step further in the derivation. From 
Eq. (|4.15(l . it is clear that the Taylor series expansion 
of W" n , (q) in Eq. H5.5(l is equivalent to a Taylor series 
expansion of the screened atomic pseudopotential (|5.4|) 
with respect to q = k — k'. Hence, the physical origin of 
the linear and quadratic (in q) terms in Eq. I|5.5I) is sim- 
ply the finite slope and curvature of the screened atomic 
pseudopotentials in momentum space. 

This demonstrates that the Hamiltonian of a hetero- 
structure depends not just on the values of the atomic 
pseudopotentials w a (k) at the reciprocal lattice vectors 
G, but also at a range of k values in a finite neighbor- 
hood of each G (the size of the neighborhood depending 
on how rapidly varying the envelope function is). The 
necessity for an accurate fitting of empirical pseudopo- 
tentials over a range of k values has been emphasized 
particularly in the work of Mader and Zunger>M& The 
truncated expansion (|5.5|l shows that the present pertur- 
bation scheme relies for its accuracy upon the validity 
of a quadratic extrapolation of u> Q (k) in the neighbor- 
hood of each G. Inspection of the form of typical atomic 
pseudopotentialsi^^iiSl shows this to be a good approxi- 
mation. 

The definition l|5.5[l of the interface band-mixing pa- 
rameters Z and Y highlights a significant difference be- 
tween the present approach and previous derivations of 
envelope-function Hamiltonians. In previous derivations, 



the heterostructure potential was chosen to have the form 

K(x) = £^(x)0<(x), (5.9) 

in which V (x) is the periodic potential for the bulk ma- 
terial I, and #'(x) is a form factor determining the com- 
position of the heterostructure. Within this model, V(x) 
depends only upon the atomic pseudopotentials at the 
reciprocal lattice vectors G, with the magnitude of the 
interface (5-like terms determined by 6>'(x). In the limit 
of slowly varying #'(x), the interface terms vanish. 

The present results show that this behavior is an un- 
physical artifact of the model l|5.9[l . In the present the- 
ory, the strength of the interface terms is the same (in 
the virtual crystal approximation) for slowly graded and 
abrupt heterostructures, with the rate of change of # a (x) 
affecting only the width of the interface terms. 



2. Nonlocal analytic terms 

The more general case of a nonlocal potential involves 
only a straightforward extension of these results. The 
matrix element W" n , (k,k') of Eq. 14.15fl is expanded to 
second order in k and k' as follows: 

wc,(k, k') = w: n , + fc A (J„vr + k' x i(j* jt 

+ kxk' fl (R%r- (5-10) 

Here the antihermitian part of the self-energy was ne- 
glected, as discussed above Eq. <|5.3[1 and in Appendix lAl 
This approximation simplifies the expansion by provid- 
ing a relationship between (for example) the coefficients 
of k\ and k' x . The physical interpretation of the various 
expansion coefficients is discussed below (in Sec. IVlf) . af- 
ter perturbation theory has been used to eliminate the 
coupling to remote bands. 

When this expansion is substituted into Eq. (|4.14l) for 
W^!(k,k'), the result can be written as 

(k,k') = W^,(q) + fc A J r i,(q) + fc A [J„ A ,„(-q)]* 
+ fcAfc^C(q) + fc A fc;[AC A „(-q)]* 

+ *A*^£/(q), (5-11) 

in which q = k — k' as before, and the various q-de- 
pendent functions are defined by expressions of the form 

a 

These functions all have a step- function-like behavior in x 
space at a hetero junction (see Sec. lVIIf) . Since 6^ is real, 
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they also have the hermiticity and symmetry properties 
[W^(q)}*=W^ n (-q), 

M^(q)=M„^(q). 



3. Nonanalytic terms 

The term A n „' (k, k')<^W (q) describing the nonanalytic 
contributions to Eq. I|4.12(l is handled in much the same 
way. The vertex function A„„'(k, k') is expanded in a 
Taylor series of the form l|5.10[l , while the linear electron 
density in Eq. (|4.17(l is expanded as 

«"(q) = q\nl + qxq^n^ + q\q^,q K n^ K + qxq^q^nl^, 

(5.14) 

which is valid for neutral perturbations. In a heterova- 
lent zinc-blende heterostructure described by neutral 
perturbationsji2ii2& the dipole and traceless quadrupole 
terms and — ^n„ u 8\^ are nonvanishing only at 
interfaces. For isovalent substitutions in zinc-blende, 
n" = and n"^ = ^n^Sx^ everywhere, so the contribu- 
tion from the latter term is analytic and can be absorbed 
into the definition of the analytic potential (|5.1()(l . The 
remaining terms and ro" MK „ describe octopole and 

hexadecapole moments^Sa 

The inverse dielectric function e _1 (q) in Eq. (|4.16J) can 
also be expanded in a power series. For isovalent per- 
turbations in zinc-blende this has no qualitative signif- 
icance, since the leading correction merely renormalizes 
the hexadecapole term n" AtKJ/ ii24 However, the wave vec- 
tor dependence of e _1 (q) does generate a qualitatively 
new contribution from the i nterf ace dipole term in Eq. 
(JEHU); see Eq. (6.23) of Ref.IHifor the explicit form of 
this term. 

With the expansion (|5.14|) . the effective density (|4.17|) 
has the same form as that derived above for the local 
potential in Eq. (|5.5|) . In coordinate space, it involves a 
series of derivatives of 9 a (x) , similar to the result shown 
in Eq. I|5.6[l . This is the same as the usual multipole 
expansion of the macroscopic charge density in classi- 
cal electromagnetismiASi In particular, note that for a 
perturbation consisting of a single impurity atom, the 
function # Q (x) is just the macroscopic average of a Dirac 
5 function, i n com plete agreement with the expressions 
given in Ref . Il5ll 

Since the Taylor series expansion for A„ n <(k, k') is 
identical in form to that for W" n , (k, k'), the contribu- 
tion from this term will not be written out explicitly here. 
All subsequent perturbation theory analysis for the two 
terms is formally identical, except that the vertex func- 
tion is multiplied by an extra factor of <^(q). 



C. Quadratic heterostructure potential 

1. Analytic terms 

Because the quadratic response is already of order 
{AV) 2 /E g , the quadratic version of Eq. (|4.14|) can be 
replaced by the zeroth-order approximation 

W,[ 2 J,(k,k') = J2' ^ a ' R '(q)^' R ' 

a, a 7 ,R/ 

^W^(q), (5.15) 

in which 

WZ' R ' = t lim WZ R '(K k'), (5.16) 

where the limit is well defined for neutral perturba- 
tionsii24 This contributes a local potential-energy term 
similar to that given by W^,(q) in Eq. QS.llfl . At a 
hetcrojunction, the functional dependence in x space is 
similar to that of a smooth (macroscopically averaged) 
step function, with possible deviations in the vicinity of 
the junction. 

Now since W"3 R depends only on the part of 
W""i R (k, k') that is analytic in k and k', it is a short- 
range quantity that is significant only when R' + T a < — T a 
is comparable to the lattice constant a. Therefore, at 
a heterojunction, replacing VF„ n /(q) with an ideal step 
function would generate an error of order ka in a term of 
order (ka) 2 AV. Hence, the error is of the same order as 
the local approximation l|2.8[) and can be neglected. 

Thus, within the present perturbation scheme there 

~ (2) 

is no interface contribution from Wj n /(q). This is an 
important result, as it simplifies the analysis of interface 
effects in later sections of this paper. 

Within the context of an empirical pseudopotential 
modelfi^ the nonlinear bulk term Wj^, (q) derived here 
can be represented as an environment dependence of the 
screened empirical pseudopotential, in which (for exam- 
ple) the pseudopotential for an As atom in GaAs is differ- 
ent from that for an As atom in AlAs. The importance 
of accounting for such effects has been emphasized by 
Mader and Zunger>i^ 

2. Nonanalytic terms 

The leading terms in the quadratic density are the 
dipole and quadrupole terms 

n w — q^ n \ +q>~q^ n \n > l 5 - 1 ') 

which again is valid for neutral perturbations. (Here the 
constant term vanishes even for charged perturbations as 
long as the system is insulatingji24 but in this case there 
is an additional nonanalytic term of order q 2 ^24) In zinc- 
blende materials, the net contributions to (q) from 
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n™ R and the traceless part of n™" R both vanish in 
the bulk regions of a heterostructure. The interface part 
of n"^ R is negligible under the current approximation 
scheme, while the bulk part can be absorbed into the 

~ (2) 

definition of the analytic potential W^,(q). Therefore, 
only the interface dipole term remains under the current 
approximation scheme. 

D. Two-dimensional systems 

In a heterostructure (such as a quantum well) with 
two-dimensional translation symmetry, the nonanalytic 
terms arising from <p(q) have a particularly simple form. 
Let the dimensionless coordinates Xi and kj be defined 
by x = XiB.i and k = kjbj, where aj and bj are basis 
vectors for the direct and reciprocal lattices of the refer- 
ence crystal, with • hj = 2%Sij. In these coordinates, 
the lattice sites are defined by Xi — Ri and kj = Gj, 
where Ri and Gj are integers. In such a two-dimensional 
system, one can choose ai and a.2 to lie parallel to the 
junction plane, so that the atomic distribution function 
9^ is independent of R\ and i?2- The Fourier transform 
6 a (k) then has the form 

r(k)=0 a (fc 3 )$> k||G|| e- <G H^, (5.18) 

where k|| = fcibi+fc2b2. For small k||, the k|| dependence 
is simply JkyO- The same conclusion holds for the pair 

distribution function aa R (k) . 

Therefore, the nonanalytic potential l|4.16[) has the 
form ip(q) = <p(q3)Sq ]] o, which is independent of the di- 
rection of q. As a result, all terms in <p(q) except the 
monopole and dipole terms reduce to analytic functions 
of <?3 , which can be absorbed into the definition of the an- 
alytic potential W. In regard to the monopole and dipole 
terms, this paper considers only neutral perturbations in 
crystals with no bulk dipole moment. Thus, the only 
nonanalytic contributions are the 1/^3 terms generated 
by the interface dipoles in Eqs. I|5.14|l and l|5.17|l . 

These have the same dependence as the Fourier 
transform of a step function, and merely add extra terms 
to the band offsets at a heterojunction. Hence, the inter- 
face dipole contributions can be absorbed into the defini- 
tion of W^> (q) an d Wj®, (q). Note that in a no-common- 
atom system such as InAs/GaSb, the contribution from 
the quadratic interface dipole to the band offset has a 
different value for GaAs-like and InSb-like interfacesii 5 ^ 



In summary, for the material systems considered in 
this paper, the nonanalytic potential tp(q) does not con- 
tribute anything qualitatively new in a heterostructure 
with two-dimensional translation symmetry. Only the 
interface dipole term is truly nonanalytic, and that can 
be absorbed into the definition of the band offsets (al- 
though this contribution does depend on the microscopic 
structure of the interfac e 152 ). Long-range potentials aris- 
ing from the direction dependence of </?(q) appear only 
in structures with lower translation symmetry such as 
quantum wires and dots. 



VI. ELIMINATION OF INTERBAND 
COUPLING 

In this section, perturbation theor y 31 i 34 i 127 is used to 
derive a multi-band effective-mass Hamiltonian from the 
infinite-dimensional matrix equations l|4.8[) . The method 
is outlined briefly here; for further details see Refs. l34l 
andlM 

The zone-center states n of the reference crystal are di- 
vided into a class A containing the states of interest, and 
a class B containing all other states. The total Hamil- 
tonian is written as H = Hq + H', where Hq has ma- 
trix elements (-ffo)mm' = E m 5 mm i, and m = (n, k) is a 
composite index. The unperturbed Hamiltonian Hq is 
assumed to be hermitian, but the perturbation H' need 
not be. A similarity transformation H = e~ s He s is used 
to eliminate the coupling between sets A and B to any de- 
sired order in the perturbation H' . (The transformation 
is unitary if H' is hermitian, as is approximately the case 
here.) This yields a finite-dimensional effective Hamilto- 
nian for states m, m' G A, which is given explicitly (to 
third order in H') in Eq. ljCl|) of Appendix [U] 

For the case considered here, the matrix elements of 
H and H' are given by Eq. (|4.8|) as (nk\H \n'k') = 
E n 5 nn >5 kk > and 

(nk\H'\n'k') = L„ n ,(k)5 kk , + ^»(k, k') + W^(q) 

+ A w (k,k>(q), (6.1) 

in which L„„/(k) is defined in Eq. 15. ID . W^^k, k') in 

Eq. (|5.11|) . and W^,(q) in Eq. (|5.15l) . Upon inserting 
these matrix elements into Eq. IjClll . one obtains the 
effective-mass Hamiltonian (for n, n' G A) 



(nk\H\n'k') = {E n S nn/ + k x ^ n , + k x k^D^, + k x k^k K C^ + k x k^k K k u Q^r)6 k w + W^ t {q) + W^,(q) 
+ k X J^(q) + k' x [J* n (-q)T + k x k,M^,(q) + k' x k^M^ n (-q)V + k^R^q) 

+ [A nn , + k x J* n , + k' x (J^ n )* + kxk„M^ t + k' x kl(M^ n )* + k x k'^R x Z,Mq). (6.2) 
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In this expression, ir* n , is the kinetic momentum matrix 
(|5.2|1 of the reference crystal, 2D J^, is the inverse effective 
mass tensor of the reference crystal [see Eq. (|C2|) ] , and 
C^f and Qjrf" are the coefficients of the cubic and 
quartic dispersion terms in the reference crystal [see Eqs. 
(|C3J| and i|C4|l ]. These are just renormalized versions of 
the quantities D%„ C^, and Qffi defined previously 
in Eq. E3- 

Likewise, the functions wj^,(q), J* n ,(q), M^,(q), 

and i?^/(q) are all renormalized versions of quantities 
defined previously. The renormalized functions are given 



explicitly in Appendix IU1 

The terms multiplying the screened nonanalytic poten- 
tial ip(q) are derived from the Taylor series expansion and 
k • 7r renormalization of the vertex function A rm /(k, k'). 
The various constant coefficients (e.g., R n ^,) are defined 
in the same way as the analogous q-dcpcndcnt functions 
[e.g., -R^/(q)] given in Appendix but with Wj^,(q) 
replaced by the constant A„„/ = A„„/(0, 0). 

Equation H6.2(l can now be rearranged^d^ and Fourier 
transformed to obtain the effective-mass Hamiltonian 



H nn ,(x,p) = E n 5 nn , + {p A) 7T* n ,(x)} + {p A ^,^< } «} +PXP^PnC^ + PxP^p v QlT + + W$(x) 

+ 9 A ^(x) + d x dX%(x) + ^r^,(x)} + [d^WK 

+ A w </?(x) + TT^ n ,{ Px , ip(x)} + D2£ } {pxPn, v(x)} 

+ Z^dxrtx) + Y^,d x dM*) + f »£{pa, «x)} + [9^(x)]pa, (6.3) 



in which {A, £} = {AS} = |(AB + BA) is the sym- 
metrized product, p = — iV is the momentum operator, 
and d\ — d/dx\ acts only on the function immediately 
to its right. 

All of the x-dependent functions are defined in terms 
of the k-space cutoff (15.711 . The quantity 

*nn' (x) - vr^, + J„ A „, (x) + [j£, n (x)]* (6.4) 

is the material-dependent kinetic momentum matrix for 
the heterostructure, while 

^W=^+^(x) + [M^„(x)]*+^,(x) (6.5) 

is half the inverse effective mass tensor for the hetero- 
structure. This has symmetric and antisymmetric parts: 

^ } (x) = i[^(x)+^,(x)], 

* ] (x) = |[^(x)- J D^(x)], 

although the antisymmetric part has a nonvanishing con- 
tribution only in the presence of a magnetic field. 

The functions Z, Y, T, and $ in Eq. I|6.3|) are all in- 
terface terms. The first two terms 

Z£ n ,(x) = -*§(J* n ,(x) - [J„ A , n (x)]*), (6.7) 

= (6-8) 

are renormalized versions of the <5 and 5' mixing poten- 
tials considered previously in Eqs. (|5.5() and l|5.6|l . How- 
ever, the other two terms 

r^,(x) = -i(M$*>(x) - [M^>(x)]*), (6.9) 
^( x )=^(x), (6.10) 



were not present in Eqs. 1|5.5[) and l|5.6|l . The term 
$^,(x), which is antisymmetric in A and /i, is just a 
generalized Rashba coefficient 31 i 153 i 154 i 155 i 156 for multi- 
band Hamiltonians. However, the term T ^,(x), which 
is symmetric in A and has received little attention 

in the literature. Its physical interpretation will be dis- 
cussed below in Sec. IVIIII 

Since the functions Z, Y, T, and <& behave to lowest 
order as step functions at an abrupt junction, the Hamil- 
tonian H6.3fl shows explicitly that these functions produce 
interface terms proportional to 6 or 6'. 

The remaining terms (it, Z, etc.) in Eq. (|6.3|l that ap- 
pear in front of y(x) are defined by the obvious gen- 
eralizations of Eqs. H6.4fl - H6.10fl . The interpretation of 
these terms parallels that of the terms already discussed, 
except that the contributions from ip are not as well lo- 
calized at the interface. 

Here it is worth noting that in the GW approxima- 
tionf 94 i 95 one nag A nn , = s nn , and 7T A n , = D n £, = 0, so 
the potential <p(x) does not contribute to the momentum 
matrix or the effective-mass tensor. In LDA, these simpli- 
fications are also valid, and one has in addition Z^ n , = 0, 
since the exchange-correlation potential is short-ranged. 

For isovalent zinc-blende systems, the leading term in 
is an octopole potential, so the contributions from 
the second-rank tensors D, Y, T, and $ are negligible. 
These terms are non-negligible only for the linear inter- 
face dipole term in a heterovalent zinc-blende system (or 
for a slowly varying external potential, which is not con- 
sidered explicitly here). Likewise, is negligible in all 
terms except A nn i. 

Equation (|6.3|) is the main result obtained in this 
paper. The qualitative form of this Hamiltonian is 
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very similar to the Leibler Hamiltonian^LiSi for slowly 
graded heterostructures, as amended by Takhtamirov 
and Volkovi££'££*££ The differences are primarily due to 
the use of atomic pseudopotentials (rather than a model 
based on periodic bulk potentials), the inclusion of long- 
range Coulomb potentials, and the use of linear response 
theory to simplify the interface Hamiltonian, as discussed 
in Sec. E| 

The explicit form of the various material parameters in 
Eq. H6.3JI for semiconductors with the zinc-blende struc- 
ture is given below in Sec. IVIlil First, however, the pos- 
sibility of representing the material parameters as piece- 
wise constant is considered. 



VII. SIMPLIFIED MATHEMATICAL 
REPRESENTATION OF HETEROSTRUCTURE 
MATERIAL PROPERTIES 

This section discusses several ways in which the math- 
ematical description of material properties can be sim- 
plified. The first is to label the materials in terms of 
bulk compounds (e.g., GaAs) rather than atoms; the sec- 
ond is to approximate the properties of an abrupt junc- 
tion using piecewise constant material parameters with 
5 functions and their derivatives at interfaces. For sim- 
plicity, only two-dimensional isovalent systems are con- 
sidered here (see Sec. lVDj l. 



A. Transformation from atomic to bulk-crystal 
description 

All of the linear-response terms can be transformed 
immediately to a bulk-crystal representation similar to 
that described above in Eq. (|5.9|l : 

W$(*)= £V(x)W^ (7.1a) 

a 

= ^V(xX„,, (7.1b) 



in which I labels the different bulk materials, and [cf. Eq. 

E3] 



5>'(*) = o. 



(7.1c) 



For example, in a GaAs/AlAs heterostructure, the linear 
potentials for the two bulk media are defined by 



Ti^GaAs _ iJ/Ga , tttAs 

nn nn 1 nn ' 

j^/AIAs = w Al , w As 

* ' 71.71.' * ' 71.71.' 1 ' ' 71.77.' ' 



(7.2) 



If the reference crystal is chosen to be GaAs, then W G 3 = 
W£~n' = 0. [This choice is made for clarity of exposition; 
the final results given in Eqs. I|7.4|l . I|7.9|l . I|7.14|l . and 



(|7.16|l do not depend on the choice of reference crystal.] 
The transformation from (|7.1a|) to (|7.1b|) is then simply 



W (1) ,(x) 

nn' \ J 



rA] 



Alt 



^AlAs 
' nn' 



Hence 



nAlAs 



iAI 



oGaAs 



qGa 



(7.3) 



(7.4) 



For a no-common-atom system such as InAs/GaSb, 
four different bulk potentials can be defined: 



(7.5) 



If the reference crystal is chosen to be InAs, then W^, = 
W A *, = 0. In the atomic description, the linear response 
potential is 



Ti/InAs 
r r nn f 


= w ln , - 

r r nn 


f w As , 


TT/InSb 
r r nn' 


= w ln , - 

r r nn 


f W Sh , 


TrrGaAs 
r r nn' 


= W Ga , - 

r r nn 


f W As , 


TT/GaSb 

nn 


= W Ga , - 

nn 


f W Sh , 



Sb 
nn' i 



(7.6) 



which can be rewritten (using 8 ln + 9 Ga — and 9 As 
6> sb = 0) in several different ways, two of which are 



W. 



(i) 



e Ga w^P h + (6» In + e Sh )w^ b 



and 



(i) 



6 Sh W^? h + (9 Ga + e As )w Ga , 



GaAs 



(7.7a) 



(7.7b) 



Equation (|7.7a() is useful for describing a (001) hetero- 
junction with an InSb-like interface: 



As-In-As-In-Sb-Ga-Sb-Ga-- • • 



(7.8a) 



while Eq. I|7.7b|) is useful for describing a heterojunction 
with a GaAs-like interface: 

• • • -In-As-In-As-Ga-Sb-Ga-Sb- • • • . (7.8b) 

In the first case one can identify the bulk functions 



^GaSb 
/llnSb 



9 Ga , 

nln i 



3 Sb 



while in the second case 

gG&Sb _ QSb 
/iGaAs /]Ga , nAs 



qIuAs 
qGaAs 



qlnAs 
qlnSb 



= 0, 



= 0. 



(7.9a) 



(7.9b) 



This type of transformation can be used for any term in 
the linear response. For the quadratic response, such 
a description is not appropriate, but all contributions 
(including the interface dipole potential) can be approx- 
imated as abrupt step functions, as discussed in Sees. 
lyTHl andlVDl 
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B. Piecewise constant material parameters 

The next step in simplifying the description of the ma- 
terial properties is to approximate an ideal heterostruc- 
ture as piecewise constant. As a specific example, the 
case of a (001) heterojunction between semiconductors 
with the zinc-blende structure is considered here. 

For this case, a convenient slab- adapte d^ 8 '"— unit cell 
is defined by the basis vectors 



-(1 



-(1,1,0), H3 = -(1,0,1) 



ai = 77(1, -1,0), a 2 

(7.10) 

Periodic boundary conditions are applied over the crystal 
volume Q = Li • (L 2 x L3), where Lj = iV^a^ and Ni is 
an integer (thus fi = Nfl , where N = NiN 2 N 3 ). 

The bulk properties of the heterojunction are to be 
represented in terms of the periodic step function 



e(x) = e(z) = 



1, < z < \L Z 
0, -\L Z < z < 



(7.11) 



in which L z = ^N^a is the period in the z direction, 
and G(x) = 0(x + Lj). The interface properties are to 
be represented by the derivatives z (x) = dQ/dz and 
©zz(x) = d 2 Q/dz 2 , which are periodic arrays of 6 and 5' 
functions. 

The first example of an actual (001) heterojunction 
to be considered is a common- atom GaAs/AlAs junc- 
tion. The coordinate origin is chosen to be an inter- 
face As atom, with unit-cell basis vectors T a = and 
t c = ja(l, 1, 1) for anions and cations, respectively. The 
discrete function 9^ for a — Al is therefore 



9£ 



1, 0<i? 3 



hN* - 1 



0. 



rJV 3 < i?, < -1 



(7.12) 



where the dimensionless coordinate R3 is an integer (see 
Sec. IV JJjl . From the Fourier transform of this function, 
one finds that for small k, 



# A1 (k) * e(k) - ^e zz (k). 



(7.13) 



Upon inserting this result into Eq. J72J, one finds that 
for \z\ < \L Z , the linear band offset at a GaAs/AlAs 
heterojunction can be represented as 



AlAs 
nn' 



w 



GaAs 



)[®{z)-x] 



(Hi 



(W, 



AlAs 

nn' 



w 



GaAs 



)6'(z), (7.14) 



which has been written in a general form suitable for 
a virtual reference crystal (GaAs) (AlAs) x . Thus, if 
the band offset is treated as piecewise constant, there is 
an additional correction proportional to 5'(z) that has 
the effect of renormalizing the matrix Y in the Hamil- 
tonian (|6.3|l . This merely reflects the fact that 9 A1 (z) is 
a smooth (macroscopically averaged) step function, and 



the difference between a smooth step function and an 
abrupt step function is, to leading order, proportional to 
the macroscopic average of S'(z). 

The remaining material-dependent parameters in Eq. 
(|6.3() can be treated using the same approach, but for 
these parameters the term proportional to Z2 yields a 
correction of order (ka) 3 AV or higher, which can be ne- 
glected. Thus for an ideal GaAs/AlAs junction, all of 

the material parameters in i|tj.3fl except (x) can be 
replaced by abrupt step functions. 

For the case of a no-common-atom InAs/GaSb junc- 
tion, the situation is more complicated. The example 
considered here is the GaAs-like junction in Eq. (|7.8b|l . 
where the origin is the midpoint of an As-Ga bond and 
t c = —T a = gd(l, 1, 1). In this case, the small-fc behavior 
of the atomic distribution functions is 



^ a (k) ~ e(k) + -e,(k) - 3^©**(k), 
]e 2 (k)-^e zz( k). 



p b (k) ~ e(k) 



Upon inserting these results into Eq. 



(7.15) 



one finds 



that for \z\ < 
imated by 



;L Z , the linear band offset may be approx- 



GaSb 
nn' 



+ ^(2IU„ G „ a , As - W l ^ s - W^ h )5(z) 
2 



— (W 

384 1 ' 



%? b -W%F)6'(z), (7.16a) 



which is written in a form suitable for the virtual refer- 
ence crystal (InAs) 1 ^(GaSb)^.. Note that this has the 
form 

W<£,(x) = (w£}-w£)[Q(z)-x] 

-^(W$-W<rf)8'(z), (7.16b) 

in which (+) and (— ) label the bulk materials to the 
right and left of the junction, (i) labels the "interface" 
material, and the reference crystal is (+)i- x (— )x- This 
result is valid for any no-common-atom (001) junction, 
regardless of the convention chosen for the t vectors. 
Thus for a no-common-atom junction, WjQ (x) contains 
terms that renormalize both the Z and Y matrices. 

The 8{z) term in l|7.16|l has a very simple physical in- 
terpretation in which half a monolayer (a/4) at the in- 
terface is occupied by GaAs instead of either InAs or 
GaSb. This concept has been used previously in the 
construction of envelope-function models for electronsiSS 
and phononsiiSi 

Note that if one replaces Sb with As in Eq. I|7.16a|l . 
the result does not reduce directly to an expression of 
the form l|7.14|l . However, the difference is merely due to 
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the different choice of coordinate origin in the two cases. 
To leading order, Q(z — |a) ~ Q(z) — ^aS(z) + jjga 2 5'(z), 
S(z — |a) ~ 5(z) — ^aS'(z), and S'(z — |a) ~ S'(z). Thus, 
replacing Sb with As and shifting the origin by ^a(l, 1, 1) 
does indeed reduce l|7.16a|l to (|7.14|) . 

The result (|7.16b|) can be applied to all other linear- 
response terms in the Hamiltonian (|6.3|) . but since the 
A-7r(x) and Z(x.) terms are already of order (ka)AV, the 
S'(z) term in (|7.16b|l yields a negligible correction of order 
{kafAV. Thus 

Z* n ,M = (zW-Z^)[Q{z)-x] 

+ l(2Z^-Z^-Z*7 ) )5(z), (7.17) 

and we see that Z generates a correction to Y, while 
7r generates a correction to T. However, the position 
dependence of all remaining terms in l|6.3ll (including the 
quadratic response) can be represented as a simple step 
function: 

= 0C (+) - i$ (-) )[e(2) - (7-i8) 

since these terms are already of order (ka) 2 AV. 

VIII. THEORY OF INVARIANTS 

In this section the method of invariants22i2&2!iSi2& is 
used to construct the explicit form of the Hamiltonian 
()6.3(1 for r electrons in semiconductors with the zinc- 
blende struct ure»iSSiiSi The results are then compared 
with the interface Hamiltonian given by the method of in- 
variants for a common-atom (001) junction, including all 
(linear and nonlinear) interface terms of order (ka) 2 AV 
that are permitted by symmetry. Finally, the interface 
terms of order (ka)AV are considered for the case oiT-X 
coupling in GaAs/AlAs. The results in this section make 
use of the time reversal and crystal symme try p roperties 
of the self-energy that were derived in Ref. Il24i 

A. Generalized Leibler Hamiltonian 

As demonstrated above, to within terms of order 
(ka) 2 AV, the position dependence of the interface pa- 
rameters can be calculated entirely in terms of bulk-like 
matrix elements modulated by the atomic form factors 
# Q (x). Thus, the Hamiltonian for a zinc-blende het- 
erostructure of arbitrary composition can be determined 
by constructing invariants transforming as Ti under the 
symmetry operations of the group. This amounts to 
treating # Q (x) as a "slowly varying" function that is an 
invariant of T^m^ 

The relevant basis functions for the representations of 
Td are given in Table [H The specific example to be con- 
sidered here is that of the T 8 valence band, with T 15 and 
Tq derivable as special cases of the Tg results. The ex- 
tension of these results to multi-band Hamiltonians (e.g., 



TABLE I: Basis functions for constructing invariants of Td- 
Here P is a vector operator whose components need not com- 
mute, while cr, I, and J are angular momentum (pseudovec- 
tor) operators corresponding to angular momentum |, 1, and 
§, respectively. Cyclic permutations of x, y, and z also yield 
acceptable basis functions. 

Rep. Basis functions 

f*2 JxJyJz "t" JzJyJx 

p2 1_ p2 r2 r2 t2 1_ t2 

1 12 i x g r , l x 3 J ; " x 3 " 

Tl5 Px, {PxPy}, {Ixly}, {JxJy}, V x = {Jx(Jy — J z )} 

r 2 5 i[Px,Py], Cr x , Ix, Jx, Jx 



Tfi ® Tr ® Tg) can be handled using the methods of Refs. 
I3al3l3l but is not considered explicitly here. 

The momentum matrix 7r A (x) must transform as a vec- 
tor (ri 5 ) that is odd under time reversal, which leaves 
only one possibility: 

^ A (x) = Ytf (x), (8.1) 

in which V x = {J X (J 2 — J 2 )} is a product of angular 
momentum matrices J u for a particle with spin |, and 
£(x) is some linear combination of # z (x) functions. The 
corresponding results for Ti 5 and T 6 states are obtained 
by replacing J„ — * l v (spin 1) and J u — > a v (spin i), 
respectively. This yields zero in both cases, hence 7r A (x) 
occurs only for Tg and is a relativistic effect. 

For the effective- mass tensor Z? Am (x) one obtains the 
well-known result 

- £>^(x) = i 7 i(x)<5A M f - 72(x)MJ A 2 -\ j2 ) 
-7 3 (x)(1-<5a m ){JaJ m } 

+ ie AAl!/ [/t(x) J v + q(x)J% (8.2) 

in which the Luttinger parameters 7i(x), 72 (x), 73 (x), 
k(x), and g(x) are all linear combinations of 9 l (x) func- 
tions, and ex^v is the antisymmetric unit tensor. For 
Ti 5 the parameter q is not independent (since 1^ = 
whereas for Tq only 71 and k are independent. 

The Z matrix has the same symmetry as the coor- 
dinate operator (i.e., a vector that is even under time 
reversal), so it has the formSi 

Z A (x) = |e AM ,|{J M J4C(x), (8.3) 

where £( x ) is a linear combination of & l (x) functions. 
This result has the same form for Tis, but vanishes for 
Tq. The coupling 1)8.3(1 generates a zone-center mixing of 
heavy and ligh t holes, and was proposed independently 
in Refs. H |H1 HJ and H 

The Y interface matrix has the same symmetry as the 
symmetric part of D, so it can be written 

r A ^(x) = ^(x^i - m (x)<M</a 2 - \J 2 ) 

-V3(x)(l-6 Xfl ){JxJ„}. (8.4) 
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This term has not been studied previously for Tg states, 
although the corresponding term for T 6 electrons (a di- 
rect analog of the Darwin term from the Dirac equation) 
is well known 

The r matrix has the form 

r A ^(x) = |e v „|Kx«, (8.5) 

in which x(x) is a linear combination of 9 (x) functions. 
The closely related coupling T XfJ " arising from an external 
electric field was proposed recently in Ref. [l| as a possi- 
ble mechanism for the linear and circular photogalvanic 
effects. This term is relativistic in origin and does not 
occur for Ti§ or IV 

The matrix <f> has the same symmetry as the antisym- 
metric part of D, except that it is odd rather than even 
under time reversal. It can therefore be written as 

* A "(x) = e Xli uW(x)J u + q'(*)4]- (8.6) 

For Ig electrons (where q' is not independent) this is 
the analog of the Rashba spin-splitting effect for low- 
symmetry bulk semiconductors^^ which occurs also in 
heterostructures of cubic semiconductor a 48 i 154 i 164 i 165 due 
to the reduced symmetry at a surface or interface. The 
corresponding Hamiltonian for Eg states was proposed 
in Ref. [5^, and has received renewed attention as the 
valence-band Rashba coupling2L2iiiSiiS& in recent years. 

The coefficients of the various terms involving tp in 
Eq. (|6.3(l have the same form as those already given. The 
symmetry restrictions on the Ta ylor series expansions for 
n^ 1 ' and were given in Ref. Il24l 

B. GaAs/AlAs (001) heterojunction 

The significance of these results is now investigated 
by comparing them with all interface terms of order 
(ka) 2 AV allowed by symmetry for a GaAs/AlAs (001) 
heterojunction, which has the point group Ci v . To bet- 
ter understand this comparison, it is helpful to begin 
by studying the symmetry properties of the linear and 
quadratic response in this system. 

The starting point is the observation that although the 
point group of the heterojunction is Ci v (with the coor- 
dinate origin at an interface As atom), if the reference 
crystal is chosen to be the virtual crystal Alo.5Gao.5As, 
then the perturbation due to the ionic pscudopotcntials 
has a higher symmetry: it transforms according to the 
representation X3 of the Did group. In other words, it 
transforms according to the identity representation Ai 
of the Civ group (none of whose operations change the 
z coordinate), but in addition it has odd parity with re- 
spect to those elements of Did that change the sign of 
z. This occurs because the atomic mole fraction 6^ itself 
transforms as X^(Did), whereas the atomic perturbation 
Aw;" n (x,x') has the site symmetry Ti(Td) or Xi(D 2 d)- 
Therefore, AVi on (x,x / ) transforms as X\ (3X3 — X3 un- 
der the operations of Did- 



TABLE II: Basis functions for Did- The components of a, I, 
and B = ic(P x P) transform as those of J. 

Rep. Basis functions 

Xl 1, P X -+" Py , P Z , J X + Jy , J Z , Pz{Jx Jy}l PzVz, 

P T — P T P T 3 — P T 3 P V 4- P V 

i x-J x 1 y tJ y, ± x^J x ± y tJ y, ± xVx\ ± yry, 

Pr { Jy J z } H~ Py { J z J x } 

Px Py 1 J x J y 1 P% J z 1 P z J 2 1 Px J x ~\~ Py J y 5 
Px J x Py J y 1 Px Vx Py Vy 5 Px {Jy J z\ Py { J z Jx } 
-^3 Pz 1 { Px Py } 1 \ Jx Jy } •> V2 , Pz J z 1 Px Jy Py J x 5 

Px Jy Py J x 1 Px Vy ~\~ Py Vx j Px { Jx J z } ~t~ Py {JyJz\ 

J z > J z •> Pz \ Jx Jy)f P x Jy Py J x j 

Px Jy ~T~ Py J x 1 Px Vy Py Va; , Px { Jx J z } Py { Jy J z } 
X 5 (Px,Py), ({PyPz},{PzPx}), (Jx,~Jy), {Jl,~J% 
(Vx,V y ), ({JyJ*},{JzJx}), (PzJy,-P*Jx), 
(Pz-J^-PzJ!), (Pz{JzJx},Pz{JzJy}), (P.Vy,P,V m ), 
(PyJz,—PxJz), (PxJzyPyJz), (PyJz>~PxJz)i 
[Py Vz, PxVz) , [Py {Jx Jy } , Px { Jx Jy }) 



TABLE III: Basis functions for Civ- The components of cr, 
I, and B = ic(P x P) transform as those of J. 

Rep. Basis functions 

Al 1 , P z , {Px Py } , P x ~t~ Py ) Pz ) {JxJy}, J x "t" Jy , Jz 1 J z , 
Vz 

p>2 j j2 j2 

A 3 Px + Py", {Pz{Px + Py)}, Jx ~ Jy, {Jz(Jx + Jy)} , 

J 3 — J 3 t V X + Vy 

A 4 Pi - Py, {Pz(Px - Py)}, Jx + Jy, {Jz(Jx ~ Jy)} , 

J 3 4. J 3 V — V 

< J X T ,J y , V X V y 



To determine the behavior of the screened potential 
AV(x, x'), note that since the vertex functionsiSi 
and r' 2 ' of the reference crystal transform as Ti(Td), the 
linear response (x, x') transforms as X\ ® X% = X$, 
while the quadratic response V^(x, x') transforms as 
Xi (g) X 3 (g) X 3 — Xi . (In general, all odd-order terms in 
the response transform as X3, while all even-order terms 
transform as X\.) Thus, the total response AF(x,x') = 
V^fajX 1 ) + (x,x') transforms as neither X3 nor X\, 
but as Ai(C2i,). 

If the quadratic response is of the same order as the 
linear response, then one must include all possible invari- 
ants of Ci v that are of order (ka) 2 AV when constructing 
the interface Hamiltonian. Basis functions for Did and 
C 2v are given in Tables ITT1 and IIIII respectively; note that 
both Xi and X% are compatible with Ai. All interface 
invariants of C'i v that are (1) hermitian, (2) time-reversal 
invariant, and (3) of order (ka) 2 AV or less are listed in 
Table ITVI 

The second column in this table lists the symmetry 
of each term under the operations of Did- The third 
column indicates whether the term can be constructed 
as an invariant of Td, treating 0(z) as an invariant of Td- 
(As discussed above, the latter approach can be used for 
all linear- response interface terms.) Note that in the Td 
symmetry analysis, S(z) = dO/dz transforms as z (T15) 
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TABLE IV: Terms in the (001) GaAs/AlAs interface Hamil- 
tonian constructed from invariants of Ci„. 



Term 


Did 


ri(T d ) 


Origin 


15(0) 

{Jl - | J 2 )S{z) 


Xi 


no 


— 


X\ 


no 


— 


{JxJy}S{z) 


A, 


yes 


Z x 


16' (z) 


x 3 


yes 




{Jl - | J 2 )8'{z) 


x :i 


yes 




{J X Jy}S'{z) 


Xi 


no 




V z {P z S{z)} 


Xi 


no 




{PmV»+PyVy)8(z) 


Xi 


no 




{PmVy + P v V a )S{z) 


x 3 


yes 




{PxJx ~ PyJy)S{z) 


Ai 


no 




{P x J y - P y J x )8{z) 


x 3 


yes 




{P X J 3 X - PyJl)5{z) 


Xi 


no 




{P x J S y ~ PyJl)S{ Z ) 


x 3 


yes 





but no 8{z) coupling between the bands 

ri- Xi z , X\ x —Xxy. (8-8) 

Coupling of the latter type only occurs in the quadratic 
response, or in the linear response from terms propor- 
tional to 8'(z) or i{P z 5{z)}. These conclusions agree with 
those of Ref. l6lt which were derived from a model poten- 
tial constructed from a linear superposition of atomic-like 
pseudopotentials. 

Note, however, that these results hold only for an ideal 
heterojunction. If interdiffusion of Ga and Al atoms 
breaks the X3 symmetry of the linear response, then 
there will be 8{z) coupling between the bands (|8.8|l due 
to terms similar to those derived in Eq. i|7.16[l . 

IX. DISCUSSION 



and 5'{z) transforms as z 2 (Ti or r^), whereas in the 
D2d symmetry analysis, 5{z) transforms as z 2 (X\) and 
8' (z) transforms as z (AT 3 ). 

In agreement with the general symmetry properties 
of and derived earlier, Table IIVI shows that 

all linear interface terms [derivable from ^(Td)] trans- 
form as X^,{D2d)- The remaining terms that transform 
as X\ all originate in the quadratic or higher-order re- 
sponse. Thus, if one accounts for the smallness of the 

seven out of the thirteen 



quadratic response (Sec. lIIIBl 
possible interface invariants for (001) GaAs/AlAs het- 
crojunctions can be omitted because they are actually of 
order {ka) 3 AV or higher. This represents a major simpli- 
fication over the general case in which all response terms 
are of the same order. 

For a no-common-atom heterojunction, the results of 
Sec. IVTTT A I remain valid (as they depend only on the 
bulk symmetry), but the linear response now has only 
the symmetry Ai(C2„). Thus the terms labeled X\ in 
Table IIVI now arise (in general) even in linear-response 
theory, due to the renormalization effects described above 
in Eqs. (|7.1rj[) and (|7.17l) . For example, the 8{z) term in 
Eq. H7.16fl generates the term in the first row of Table 
IIVI However, the term in the second row of Table IIVI 
is still zero (within linear response), because W„*,(x) 
in Eq. I|7.16|l couples only states of the same symmetry, 



with W«(x) 
whenever E n = E n i 



<WWnn (x) 



and Wnn 



(x) 



n'n' \ J 



C. T—X coupling 

If one extends the above analysis to the case of in- 
tervalley T—X coupling at a (001) GaAs/AlAs junction, 
it is immediately apparent (because the Z matrix has 
the symmetry of a coordinate matrix) that the linear re- 
sponse produces 5{z) coupling between the bands 



ri-x : 



3z , 



X\ z -X, 



3z , 



X3X -X. 



3yi 



(8.7) 



Since a summary of the main results of this paper 
has already been given in Sec. II CI this section will be 
limited to a discussion of the differences between the 
present theory and previous envelope-function theories 
that have appeared in the literature. The principle dif- 
ferences are that (1) the present Hamiltonian is con- 
structed from atomic pseudopotentials rather than the 
periodic potential of a bulk crystal; (2) the present theory 
is self-consistent, accounting fully for electron-electron 
Coulomb interactions; and (3) the present approach uses 
linear and quadratic response theory to simplify the func- 
tional form of the heterostructure Hamiltonian. 

Starting with point (1), previous envelope- function 

^ c „ r j :: :] 47.48. 49. 50. 51. 52. 53. 54. 55. 56. 57.58. 59. 60. 61. 62. 63. 64. 65. 66. 67. 68. 69. 70. 71. 7i 

were based on model potentials of the form (|5.9|) . which 
is a linear combination of periodic bulk potentials 
multiplied by step-like functions # ; (x). The form of 
9 l {x) near an interface was either specified as part of 
the model (e.g., abrupt step function) or treated as 
unknown — and in principle unknowable, at least within 
the model potential approach. 

Although the present theory led to a similar expression 
(|77TE|) . this was a derived result based on linear response 
theory, in which the functions #'(x) have a known form 
determined by the distribution 0p_ of atoms in the het- 
erostructure. The potential (|7.1b|) includes contributions 
from the "interface" materials (e.g., GaAs and InSb in 
an InAs/GaSb heterostructure), which are often omitted 
in the model-potential approach. Furthermore, for the 
model potential, the coefficients of the S terms in the 
Hamiltonian depend on (i) the values of the bulk poten- 
tials at the reciprocal lattice vectors G; and (ii) whether 
the functions 6 1 (k) are smooth (zero outside the Bril- 
louin zone) or sharp, with some 8 terms vanishing in the 
smooth case^SSiSLSiS 9 . In the present theory these co- 
efficients are determined not by # z (x), but instead by the 
properties of the linear atomic pseudopotentials in a fi- 
nite neighborhood of each G. 

For point (2), the present theory includes long-range 
multipole Coulomb potentials (because the self-energy is 
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nonanalytic at q = G), which were not considered in 
previous studies. These terms have no qualitative effect 
in two-dimensional systems (other than to renormalize 
the band offsets), but they are present in quantum wires 
and dots, even for isovalent heterostructures. 

Finally, for point (3), if the functions 9 (x) are treated 
as unknown in the model potential approach, one has no 
way of knowing how large the various interface terms 
are, and all terms permitted by symmetry should in 
principle be includedp*£i££i£Z- 68 - 69 However, since the lin- 
ear response is dominant (for typical heterostructures) 
in a first-principles theory, one can eliminate all nonlin- 
ear interface band-mixing terms, thereby simplifying the 
Hamiltonian considerably. 

Aside from these differences, the qualitative form of 
the Hamiltonian derived here is very similar to those de- 
rived by Leible r 47 ! 48 for slowly graded heterostructures 
and Takhtamirov and Volkov 65 i 66 i 67 i 68 i 69 for abrupt het- 
erostructures. The basic structure of these Hamiltoni- 
ans is similar because the same type of perturbation 
theory^LSiiiS 7 . was used in their derivation. 

Other differences arise in theories using different ap- 
proximation techniques. The well-known theory of 
BurtSi is based on the Luttinger-Kohn representa- 
tior4£J2I with an energy-dependent approximation sim- 
ilar to Lowdin perturbation theoryiSi used to eliminate 
the coupling to remote bands. Estimation of the error 
involved in this approximation shows that Burt's gen- 
eral theory^ includes all terms of order (ka)AV, but 
retains some terms of order (ka) 2 AV while omitting oth- 
ers. In particular, his results give an essentially correct 
(within the model potential approach) description of the 
^-function term Z and the Rashba term <E>, but omit the 
contributions from remote-band coupling to the terms Y 
and T. 

In Ref. l58l I used Burt's theory, but introduced an 
additional approximation (also adopted in Refs. |S3 and 
16^) whereby the energy eigenvalue in the denomina- 
tor of the energy-dependent effective mass was replaced 
with the position-dependent energy of the bulk valence 
band maximum. This has the effect of replacing the 
Rashba coefficient in Eq. I|6.1U[1 with iD^)(x). In other 
words, the Rashba parameters k' and q' of Eq. (|8.6|) 
were replaced by the Luttinger parameters k and q of 
Eq. 1)8. 2f> . This simplifies applications of the theory be- 
cause k and q are known from bulk magnetoabsorption 
measurements jiSLiS and k can also be estimated from 
71, 72, and 73 ^iSi However, since the true Rashba pa- 
rameters k' and q' cannot be determined from bulk mea- 
surements, this simplification will produce values that 
have the correct order of magnitude but are quantita- 
tively incorrect. 

A similar approximation was used in Refs. 0, l7Ct and 
FH to estimate the (5-function mixing parameter for light 
and heavy holes [see Table IIVI and Eq. I|8.3|l ] from the 
valence-band offset. The original proposal was based on 
the Hbf model^ in which the zone-center Tis Hamilto- 
nian is written as a sum of operators B = i + {I x Iy} 



and F = \ — {I x I y } multiplied by position-dependent 
bulk valence-band energies E v . For a (001) GaAs/AlAs 
heterojunction this yields the interface Hamiltonian 

~{I x I v }(E^-Ef As )6(z), (9.1) 

whereas for an InAs/GaSb junction with a GaAs-like in- 
terface one obtains 

|(1 - 2{I x I y })[E^ As - \{E l : Ks + E^ sh )]S(z). (9.2) 

The corresponding results for Ts are given by the sub- 
stitution {I x Iy} — * \{JxJy}~^ Now the diagonal term 
[proportional to 1 S(z)] in Eq. I|9.2[) is identical (within 
linear response) to the 8(z) term derived above in Eq. 
(|7.16|) . However, the valence-band mixing term derived 
in Eqs. I|6.7|l and l|8.3|) is not related to the valence-band 
offset. Therefore, although the model of Refs. 0,[z3i and 
FH yields a coupling of the correct symmetry and order 
of magnitude, its numerical value is not reliable^ 5 - The 
same conclusion was reached in Ref. |8(| 

In Ref. |6(J, I proposed a smooth model potential^ 
for abrupt heterostructures that yields a valence-band 
mixing determined chiefly by the change in Bloch func- 
tions at an interface. This result is of course not valid 
for general model potentials 60 61 (despite claims to the 
contrar y 56 ! 57 ), and it is not supported by the present 
work. However, in Ref.|6^it was argued that the valence- 
band mixing generated by Bloch-function differences is 
negligible in comparison to that generated by a discon- 
tinuous model potential. Although the latter does not 
appear in the present first-principles theory, its analog is 
the Z term shown in Eq. (|5.6p . The former contribution 

is given by the term proportional to W^} (q) in Eqs. (|C6|) 
and H6.7|) ■ Inspection of these terms shows that both are 
of order (ka)AV. Therefore, neither the model of Ref. l60l 
[which omits the interface contributions from Eq. (|5.5|l ] 
nor the model of Refs. 66 67 68 (which omits the con- 
tribution from Bloch-function differences) gives a correct 
value for the valence-band mixing coefficient. 

In Refs. [12TITMT3 . 163. and|6^ it was proposed that in- 
tervalley Ti~Xi z and X\ x -X\ y coupling should be pro- 
portional to S(z) and of the same order of magnitude as 
Ti-X 3z , Xi z -X 3z , and X 3x -X 3y coupling. However, as 
shown above in Sec. IVIII CI the former types of coupling 
should generally be substantially weaker than the latter 
for i deal interfaces. The same conclusion was reached in 
Ref. ll3Ct where it was shown that despite the very small 
magnitude of the direct Ti~Xi z coupling, there is a net 
effective T±—Xi z coupling (as revealed by the a nticrossing 
of superlattice subbands in Fig. 1 of Ref. Il30|) generated 
by ri-A^, X\ z -X 3zi and k • p couplings that is only a 
factor of two or three smaller than the net Ti~X 3z cou- 
pling. 

However, in Refs. Il2ll3llill it was found that X x -X v 
mixing experiments could not adequately be explained 
without including (5-function Xi x -Xi y coupling of com- 
parable magnitude to X 3x -X 3y coupling. Nevertheless, 
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since Refs. mm did not test the effect of S'(z) X\ x - 
Xi y coupling, it is not yet clear that their experiments 
require such a large S(z) coupling. If 8'(z) or i{P z 5(z)} 
coupling is unable to explain their results, then their 
data may indicate the presence of nonidcal interfaces, as 
discussed in Sec. IVIII CI Another possible explanation 
would be spin-orbit coupling, which generates a direct 
6(z) coupling between Xq x and X§ y even for ideal inter- 
faces. 

Two important physical effects not considered in this 
paper are the contributions from alloying (beyond the 
virtual crystal approximation) and from strain (in the 
bulk or at an interface). The former effect is in prin- 
ciple already encompassed by the present formalism 
(for lattice-matched systems in which linear response 
is dominant), although for practical applications one 
would need to perform a detailed analysis of the sig- 
nificance of different intervalley mixing effectsiiSl The 
contribution from strain would require a nontrivial ex- 
tension of this theory, but since an ab initio linear- 
response approach to strain has already been success- 
fully applied to In . 5 3Ga .4 7 As/InP, 114 i 116 Si/Ge, 119 i 120 
and InAs/GaSbiSS within LDA, the general perturba- 
tion scheme used in this paper should work for lattice- 
mismatched systems also. 
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APPENDIX A: IMAGINARY PART OF THE 
SELF-ENERGY 

Luttingei— i has shown for homogeneous systems that 
£W vanishes at the Fermi surface to all orders in per- 
turbation theory. The generalization of his result to the 
case of inhomogeneous systems is £W(cj±iO + ) — =p J(u>), 
where u> is real andi£iiiZ£ 

C+{uj - fi + ) 2 , uj> [i + 
J(u) = { 0, ^- <uj <fi + (Al) 

C^(oj — ^i-) 2 , ijJ < [i- 

for & near /i± = fi ± \E g . Here /i is the chemical po- 
tential (in the limit of zero temperature), E g is the en- 
ergy gap, and C± > is independent of u>. The mag- 



nitude of C± may be estimated from the calculations of 
Quinn and FerrelUS for a degenerate homogeneous elec- 
tron gas. Their results may be expressed as (C±) = 1/Ei, 
where 
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,173.174 



in which ep is the Fermi energy and co p is the plasma fre- 
quency. If one treats semiconductors with the diamond or 
zinc-blende structure as a homogeneous gas with a den- 
sity of eight valence electrons per primitive unit cell, one 
obtains values of E\ ~ 126 eV for GaAs and E\ ~ 140 eV 
for Si. This is very large in comparison to the energy gap 
and suggests that will have a negligible influence on 
the band structure in typical semiconductor heterostruc- 
tures (although it has an important qualitative effect in 
producing a finite quasiparticlc lifetime) . 

Since a real semiconductor is not a degenerate homo- 
geneous electron gas, one may question whether this esti- 
mate is reliable. However, the calculations of Fleszar and 
HankeiS for Si yield E\ > 100 eV (estimated from Fig. 
3 of Ref . Il75[) , with some asymmetry between electrons 
and holes. Thus it is reasonable in heterostructures to as- 
sume that E\ » E g . Since (uj — [i±) 2 is of order (At^) 2 , 
this means that £W is negligible under the present per- 
turbation scheme. Thus it will, for the most part, not be 
considered explicitly in the envelope-function equations 
derived here. However, the leading contribution from 
is noted in Eq. I|5.3I) so that it may be included if desired. 



APPENDIX B: CUTOFF FUNCTION 

This appendix considers possible definitions of the Bril- 
louin zone cutoff function B(k) introduced in Eq. (|5.7|l . 
The most obvious choice [from Eq. I|3.19a|l ] would be 



B(k) 



i, k e n* 
o, k 4 n 







(Bl) 



which was used previously in Refs. |4y and |5(J How- 
ever, the sharp cutoff at the boundary of has the 
undesirable effect of producing Gibbs oscillations in x 
space. Other possibilities that eliminate this problem 
are B(k) = [1 + f3{ka) 2 ] exp[-/?(/ca) 2 ] and fl(k) = 
exp[— f3(ka) 4 ], in which (3 is some number of order 1. 
These choices of B(k) are smooth, spherically symmet- 
ric, and introduce a negligible error of order (ka^AV. 

For some applications one need not introduce any cut- 
off function at all, as an alternative power-series approx- 
imation for 6 a (k) is often more convenient. This approx- 
imation is discussed in Sec. IVII Bl 
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APPENDIX C: RENORMALIZED EFFECTIVE-MASS PARAMETERS 

This appendix presents some details of the perturbation theory used in Sec. lVIl For states m, m! € A, the effective 
Hamiltonian H is given bji^LM 

B / -, , \ „ B B / ^ 



H mm > - H mm , + ^E H' mi H' lm , ( — — + — — - J + - ^ H' mi H' i:j H', jml \-^r— 



r, mi irrv I P , P / 1 o »J \ E E E i 

1 3 

B A 

2 



_ ^ ' ^ ' I mm" m"i im' _|_ rL rni n im" m"m' j _|_ Q ^Jj'^^ (CI) 

i m" x ' 



in which i5 m ^ = E m — Ei . This expression can be used to derive most of the terms in the effective- mass Hamiltonian 
(|6.2|) . The coefficients of the quadratic, cubic, and quartic dispersion terms of the reference crystal are given by 

= d x s> + \ E <A (-^- + ^rX ( C2 ) 



B / \ ^ B B 



rin' nn' 



\ E^ntAv + ( pTT + ) + J E E ^"Wn' ( 7F> F , F , 



1 \ " \ W 7r nn" n n"i 7r in' , 7V ni 7V m" 7T n"n' \ fr<1\ 

2^1 E m E^ % + E^E^t Y (C3) 



i n" 



Q\[IK,V ^.XflKV . 1 \ / f)Au fsKV I _A fillKV . AAuK w ~> [ _ i ' 

nn' o / -^in' T n ni^in> + °ni "m'Jl'S "pi 

B B , 
' rj / ^ / j \ ni "ij jn' ' ni ij jn' ' ni ij jn' ) \ j 

B A 



1 j 

^m'^n^^m' t W L 'n"rm' + W^m' _|_ U ni/ K in ,f7t n"n' ^ 7r ni LJ in" 7r n /, n' ^ ^ ni 1 * %n" U n' 1 n 1 



+ 0(tt 4 ). (C4) 



In the last expression, the symbol 0(ir 4 ) denotes terms of the fourth order in the kinetic momentum ir. These cannot 
be obtained from Eq. (|(TJl . but they can be derived easily from Eq. (B15e) on p. 205 of Ref . IsH 

(2) 

The term W^,(q) is a rcnormalized second-order contribution to the band offsets, given by 



W {2) 

nn 



> (x) = Wfj,(x) + + J-^) tf(x) + A„^( 1 )(x)][M/i 1 , ) (x) + A^^C^Cx)], (C5) 



where (q) and W^^^) were defined in Eqs. H5.11|l and (|5 . 1 5|) . The only contribution from ip^ to be included 
here is the linear interface dipole term l|5.14|l for heterovalent systems; but note that this contribution is zero in the 
GW approximation 94,95 and in density- functional theory^SS (since A lm / = 6 nn i in these cases). 

The remaining terms in (|6.2I) are renormalized versions of the functions J^ n , (q) , MJ^, (q) , and R^t, (q) defined in 
Eqs. (|5~TT|l and FTTHr . 



JL> (q) = JL> (q) + \ E <i w ® (q) f 7^ + tM • 



(C6) 
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M, 



A// 



(q) = M^(q) + 1 E (4- + + \ E j&fo) f ^- + f-) 



Iee^-^w 



EniE n j 



B A 



Jee 



tt a „7r M „-W /(1 , ) (q) 7r A -7r M „T^ ( ^ ,(qV 
"to " n"i " in' \HV i "m"«n" n"n' 



(C7) 



+ ^EE ^< (oK* (^r + ^7:) 



2 .. V E n ^E n ff i E n 'iE n "i 

i n 



<n ' 1 ^^"" (q)7r """' ). (C8) 

Note that M^,(q) ^ M^,(q), in contrast to Eq. flTTSj l. 
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